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Human Sel-10 Polypeptides and Polynucleotides that Encode Them 

CROSS-REFERENCE TO RELATED APPLICATIONS 

5 This application claims the benefit of the following provisional application: U.S. 

Serial No. 60/068,243, filed 19 December 1997, under 35 USC 119(e)(1). This application 
also claims the benefit of the following application: U.S. Serial No.09/213,888, filed 17 
December 1998, under 35 USC 120. 

10 FIELD OF THE INVENTION 

The present invention provides isolated nucleic acid molecules comprising a 
polynucleotide encoding either of two alternative splice variants of human sel-10, one of 
which is expressed in hippocampal cells, and one of which is expressed in mammary cells. 
15 The invention also provides isolated sel-10 polypeptides. 

BACKGROUND OF THE INVENTION 

Alzheimer's disease (AD) is a degenerative disorder of the central nervous system 
20 which causes progressive memory and cognitive decline during mid to late adult life. The 
disease is accompanied by a wide range of neuropathologic features including extracellular 
amyloid plaques and intra-neuronal neurofibrillary tangles. (Sherrington, R., et al; Nature 
375: 754-60 (1995)). Although the pathogenic pathway leading to AD is not well 
understood, several genetic loci are known to be involved in the development of the disease. 
25 Genes associated with early onset Alzheimer's disease (AD) have been identified by 

the use of mapping studies in families with early-onset AD. These studies have shown that 
genetic loci on chromosomes 1 and 14 were likely to be involved in AD. Positional cloning 
of the chromosome 14 locus identified a novel mutant gene encoding an eight- 
transmembrane domain protein which subsequently was named presenilin-1 (PS-1). 
30 (Sherrington, R., et al; Nature 375: 754-60 (1995)). Blast search of the human EST 
database revealed a single EST exhibiting homology to PS-1, designated presenilin-2 (PS-2) 
which was shown to be the gene associated with AD on chromosome 1. (Levy-Lahad, E. et 
al, Science 269:913-911 (1995); Rogaev, E. I., et al, Nature 376: 775-8 (1995); Li, J. et 
al, Proc. Natl Acad. Sci. U.S.A. 92: 12180-12184 (1995)). 

-1- 



6142.N2CP 0 



Mutations in PS-1 and PS-2 that are associated with Alzheimer's disease are 
primarily missense mutations. Both PS-1 and PS-2 undergo proteolytic processing, which 
can be altered by the point mutations found in familial Alzheimer's disease [Perez-Tur, J. et 
al, Neuroreport 7: 297-301 (1995); Mercken, M. et al, FEBS Lett. 389: 297-303 (1996)]. 
5 PS-1 gene expression is widely distributed across tissues, while the highest levels of PS-2 
mRNA are found in pancreas and skeletal muscle. (Li, J. et al, Proc. Natl. Acad. Set U.S.A. 
92: 12180-12184 (1995); Jinhe Li, personal communication). The highest levels of PS-2 
protein, however, are found in brain (Jinhe Li, personal communication). Both PS-1 and 
PS-2 proteins have been localized to the endoplasmic reticulum, the Golgi apparatus, and 

10 the nuclear envelope. (Jinhe Li, personal communication; Kovacs, D.M. et al, Nat. Med. 
2:224-229 (1996); Doan, A. et al, Neuron 17: 1023-1030 (1996)). Mutations in either the 
PS-1 gene or the PS-2 gene alter the processing of the amyloid protein precursor (APP) 
such that the ratio of A-betai-42 is increased relative to A-betai^o (Scheuner, D. et al, Nat. 
Med. 2: 864-870 (1996)). When coexpressed in transgenic mice with human APP, a similar 

15 increase in the ratio of A-betai-42 as compared to A-betai-40 is observed (Borchelt, D. R. et 
al, Neuron 17: 1005-1013 (1996); Citron, M. et al, Nat. Med. 3: 67-72 (1997); Duff, K. et 
al, Nature 383: 710-713 (1996)), together with an acceleration of the deposition of A-beta 
in amyloid plaques (Borchelt et al, Neuron 19: 939 (1997). 

Despite the above- described observations made with respect to the role of PS-1 and 

20 PS-2 in AD, their biological function remains unknown, placing them alongside a large 
number of human disease genes having an unknown biological function. Where the function 
of a gene or its product is unknown, genetic analysis in model organisms can be useful in 
placing such genes in known biochemical or genetic pathways. This is done by screening for 
extragenic mutations that either suppress or enhance the effect of mutations in the gene 

25 under analysis. For example, extragenic suppressors of loss-of-function mutations in a 
disease gene may turn on the affected genetic or biochemical pathway downstream of the 
mutant gene, while suppressers of gain-of-function mutations will probably turn the pathway 
off. 

One model organism that can be used in the elucidation of the function of the 
30 presenilin genes is C. elegans, which contains three genes having homology to PS-1 and PS- 
2, with sel-\2 having the highest degree of homology to the genes encoding the human 
presenilins. Sel-12 was discovered in a screen for genetic suppressers of an activated notch 
receptor, /m-12(d) (Levitan, D. et al, Nature 377: 351-354 (1995)). Lm-12 functions in 
development to pattern cell lineages. Hypermorphic mutations such as lin- 12(d), which 
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increase lin-Yl activity, cause a "multi- vulval" phenotype, while hypomorphic mutations 
which decrease activity cause eversion of the vulva, as well as homeotic changes in several 
other cell lineages (Greenwald, I., et al, Nature 346: 197-199 (1990); Sundaram, M. et al, 
Genetics 135: 755-763 (1993)). Sel-12 mutations suppress hypermorphic im-12(d) 
5 mutations, but only if the lin- 12(d) mutations activate signaling by the intact lin- 12(d) 
receptor (Levitan, D. et al, Nature 377: 351-354 (1995)). Lin- 12 mutations that truncate 
the cytoplasmic domain of the receptor also activate signaling (Greenwald, I., et al, Nature 
346: 197-199 (1990)), but are not suppressed by mutations of sel-12 (Levitan, D. et al., 
Nature 377: 351-354 (1995)). This implies that sel-12 mutations act upstream of the /in- 12 

10 signaling pathway, perhaps by decreasing the amount of functional lin- 12 receptor present in 
the plasma membrane. In addition to suppressing certain lin- 12 hypermorphic mutations, 
mutations to sel-\2 cause a loss-of-function for egg laying, and thus internal accumulation of 
eggs, although the mutants otherwise appear anatomically normal (Levitan, D. et al, Nature 
377: 351-354 (1995)). Sel-12 mutants can be rescued by either human PS-1 or PS-2, 

15 indicating that sel-12, PS-1 and PS-2 are functional homologues (Levitan, D., et al, Proc. 
Natl Acad. Sci. U.S.A 93: 14940-14944 (1996)). 

A second gene, sel-10, has been identified in a separate genetic screen for 
suppressors of lin- 12 hypomorphic mutations. Loss-of-function mutations in sel-10 restore 
signaling by lin- 12 hypomorphic mutants. As the lowering of sel-10 activity elevates lin- 12 

20 activity, it can be concluded that sel-10 acts as a negative regulator of lin- 12 signaling. Sel- 
10 also acts as a negative regulator of sel-12, the C. elegans presenilin homologue (Levy- 
Lahad, E. et al, Science 269:913-911 (1995)). Loss of sel-10 activity suppresses the egg 
laying defect associated with hypomorphic mutations in sel-12 (Iva Greenwald, personal 
communication). The effect of loss-of-function mutations to sel-10 on Zin-12 and sel-12 

25 mutations indicates that sel-10 acts as a negative regulator of both lin-12/notch and 
presenilin activity. Thus, a human homologue of C. elegans sel-10 would be expected to 
interact genetically and/or physiologically with human presenilin genes in ways relevant to 
the pathogenesis of Alzheimer's Disease. 

In view of the foregoing, it will be clear that there is a continuing need for the 

30 identification of genes related to AD, and for the development of assays for the identification 
of agents capable of interfering with the biological pathways that lead to AD. 
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SUMMARY OF THE INVENTION 

The present invention provides isolated nucleic acid molecules comprising a 
polynucleotide encoding human sel-10, which is expressed in hippocampal cells and in 
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mammary cells. Unless otherwise noted, any reference herein to sel-10 will be understood 
to refer to human sel-10, and to encompass both hippocampal and mammary sel-10. 
Fragments of hippocampal sel-10 and mammary sel-10 are also provided. 

In a preferred embodiment, the invention provides an isolated nucleic acid molecule 
5 comprising a polynucleotide having a sequence at least 95% identical to a sequence selected 
from the group consisting of: 

(a) a nucleotide sequence encoding a human sel-10 polypeptide having 
the complete amino acid sequence selected from the group consisting of SEQ ID NO:3, 
SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, and SEQ ID NO:7, or as encoded by the 

10 cDNA clone contained in ATCC Deposit No.98978; 

(b) a nucleotide sequence encoding a human sel-10 polypeptide having 
the complete amino acid sequence selected from the group consisting of SEQ ID NO: 8, 
SEQ ID NO:9, and SEQ ID NO: 10, or as encoded by the cDNA clone contained in ATCC 
Deposit No. 98979; and 

15 (c) a nucleotide sequence complementary to the nucleotide sequence of 

(a) or (b). 

In another aspect, the invention provides an isolated nucleic acid molecule 
comprising a polynucleotide which hybridizes under stringent conditions to a polynucleotide 
encoding sel-10, or fragments thereof. 

20 The present invention also provides vectors comprising the isolated nucleic acid 

molecules of the invention, host cells into which such vectors have been introduced, and 
recombinant methods of obtaining a sel-10 polypeptide comprising culturing the above- 
described host cell and isolating the sel-10 polypeptide. 

In another aspect, the invention provides isolated sel-10 polypeptides, as well as 

25 fragments thereof. In a preferred embodiment, the sel-10 polypeptides have an amino acid 
sequence selected from the group consisting of SEQ ID NO:3, 4, 5, 6, 7, 8, 9, and 10. 
Isolated antibodies, both polyclonal and monoclonal, that bind specifically to sel-10 
polypeptides are also provided. 

30 BRIEF DESCRIPTION OF THE FIGURES 

Figures 1A and IB: Figures 1A and IB are western blots showing protein expression in 
HEK293 cells transfected with PS1-C-FLAG, 6-myc-N-sel-10, and APP695NL-KK cDNAs. 
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Figures 2A and 2B: Figures 2A and 2B are northern blots. Figure 2A is a multiple tissue 
northern blot probed with the common, mammary sel-10 mRNA showing ubiquitous 
expression of the 6.5- and 4.5-Kb transcripts. Figure 2B is a multiple tissue northern blot 
showing limited expression of the hippocampal sel-10 mRNA only in brain. 

5 Figures 3A, 3B and 3C. Figures 3A, 3B and 3C are western blots. Figure 3A 

demonstrates that SEL-10-myc forms a complex with PS1 as shown by immunoprecipitation 
with anti-PS 1 loop antibody. HEK293 cells were transfected with constructs containing 
SEL-10-myc or PS1, cotransfected with both constructs, transfected with the corresponding 
vector only control (pcDNA3 or pCS, respectively), or mock transfected without DNA. 

10 Cultures were treated with lactacystin (12 uM) to inhibit proteasome function as indicated. 
The immunoprecipitates were loaded on the left side of the gel and cell lysates on the right. 
Controls for immunoprecipitation were a nonspecific IgG and just anti-PS 1 loop antibody 
with protein G beads with no cell lysate ("no protein"). The immunoblot was probed with 
anti-myc antibody to detect SEL-10-myc in the immunoprecipitates and cell lysates. SEL- 

15 10-myc is expressed in lysates from cells transfected with SEL-10-myc or cotransfected with 
SEL-10-myc and PS1, but was detected only as a complex with PS1 in cells cotransfected 
with the SEL- 10-myc and PS1 constructs. Figure 3B demonstrates that the SEL- 10- 
myc/PSl complex could be detected only when proteasome degradation was inhibited with 
lactacystin. Figure 3C demonstrates that immunoprecipitation of the SEL-10-myc/PSl 

20 complex with anti-myc antibody indicates that SEL-10-myc associates primarily with full 
length and high molecular weight forms of PS1. Immunoprecipitates were probed with 
either anti-PS 1 loop or anti-myc antibodies. Longer exposures of the ECL developed 
Western blot reveal very low levels of PS1-NTF in the immunoprecipitate with SEL- 10- 
myc, but fail to detect PS1-CTF. 
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Figures 4A and 4B. Figures 4 A and 4B are western blots. Figure 4 A demonstrates that 
SEL-10-myc stimulates ubiquitination of PS1. An HEK293 cell line with stable expression 
of PS1 was transfected with SEL-10-myc. Immunoprecipitation with an anti-PS 1 loop 
antibody was followed by detection with an anti-ubiquitin antibody. Figure 4B demonstrates 
5 that increased expression of SEL-10-myc leads to a decrease in PS1-NTF and PS1-CTF 
with a corresponding increase in the amount of full length PS1. Cell lysates were 
immunoprecipitated with anti-PS 1 loop antibody and then probed on the Western blot with 
the same antibody to detect PS1 and its processing products. 

Figures 5A, 5B and 5C. Figures 5A, 5B and 5C are graphs. Figure 5A demonstrates that 
10 transient co-expression of SEL-10-myc with APP increases production of AJ31-40 and Afil- 
42. The effect is additive with co-expressed PS1. Figure 5B demonstrates that stable 
expression of SEL-10-myc or PS1 increases endogenous A|3 production. Figure 5C 
demonstrates that transient expression of APP in the stable cell lines increases exogenous 
A(5 production over the level seen in a cell line transformed with the pcDNA3.1 vector 
15 control 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention provides isolated nucleic acid molecules comprising a 
polynucleotide encoding human sel-10. The nucleotide sequence of human hippocampal 

20 sel-10 (hhsel-10), which sequence is given in SEQ ID NO:l, encodes five hhsel-10 
polypeptides (hhsel-lO-(l), hhsel-10-(2), hhsel-10-(3), hhsel-10-(4), and hhsel-10-(5), 
referred to collectively herein as hhsel-10). The nucleotide sequence of human mammary 
sel-10 (hmsel-10), which sequence is given in SEQ ID NO:2, encodes three hmsel-10 
polypeptides (hmSel-lO-(l), hmSel-10-(2), and hmsel-10-(3), referred to collectively herein 

25 as hmsel-10). The nucleotide sequences of the hhsel-10 polynucleotides are given in SEQ 
ID NO. 1, where nucleotide residues 45-1928 of SEQ ID NO. 1 correspond to hhsel-lO-(l), 
nucleotide residues 150-1928 of SEQ ID NO. 1 correspond to hhSel-10-(2), nucleotide 
residues 267-1928 of SEQ ID NO. 1 correspond to hhSel-10-(3), nucleotide residues 291- 
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1928 of SEQ ID NO. 1 correspond to hhSel-10-(4), and nucleotide residues 306-1928 of 
SEQ ID NO. 1 correspond to hhSel-10-(5). The nucleotide sequences of the hmSel-10 
polynucleotides are given in SEQ ID NO. 2, where nucleotide residues 180-1949 of SEQ ID 
NO. 2 correspond to hmSel-lO-(l), nucleotide residues 270-1949 of SEQ ID NO. 2 
5 correspond to hmSel-10-(2), and nucleotide residues 327-1949 of SEQ ID NO. 2 
correspond to hmSel-10-(3). The amino acid sequences of the polypeptides encoded by the 
hhSel-10 and hm-Sel-10 nucleic acid molecules are given as follows: SEQ ID NOS: 3, 4, 5, 
6, and 7 correspond to the hhSel-10-Q), hhSel-10-(2), nhSel-10-(3). hhSel-10-(4), and 
hhSel-10-(5) polypeptides, respectively, and SEQ ID NOS: 8, 9, and 10 correspond to the 

10 hmSel-lO-(l), hmSel-10-(2), and hmSel-10-(3) polypeptides, respectively. Unless otherwise 
noted, any reference herein to sel-10 will be understood to refer to human sel-10, and to 
encompass all of the hippocampal and mammary sel-10 nucleic acid molecules (in the case of 
reference to sel-10 nucleic acid, polynucleotide, DNA, RNA, or gene) or polypeptides (in 
the case of reference to sel-10 protein, polypeptide, amino acid sequnce). Fragments of 

15 hippocampal sel-10 and mammary sel-10 nucleic acid molecules and polypeptides are also 
provided. 

The nucleotide sequence of SEQ ID NO:l was obtained as described in Example 1, 
and is contained in cDNA clone PNV 102-1, which was deposited on November 9, 1998, at 
the American Type Culture Collection, 10801 University Blvd., Manassas, VA 20110, and 

20 given accession number 98978. The nucleotide sequence of SEQ ID NO:2 was obtained as 
described in Example 1, and is contained in cDNA clone PNV 108-2, which was deposited 
on November 9, 1998, at the American Type Culture Collection, 10801 University Blvd., 
Manassas, VA 201 10, and given accession number 98979. 

The human sel-10 polypeptides of the invention share homology with C. elegans sel- 

25 10, as well as with members of the p-transducin protein family, including yeast CDC4, and 
human LIS-1. This family is characterized by the presence of an F-box and multiple WD-40 
repeats (Li, J., et at, Proc. Natl. Acad. Sci. U.S.A. 92:12180-12184 (1995)). The repeats 
are 20-40 amino acids long and are bounded by gly-his (GH) and trp-asp (WD) residues. 
The three dimensional structure of P-transducin indicates that the WD40 repeats form the 

30 arms of a seven-bladed propeller like structure (Sondek, J., et at, Nature 579:369-374 
(1996)). Each blade is formed by four alternating pleats of beta-sheet with a pair of the 
conserved aspartic acid residues in the protein motif forming the limits of one internal beta 
strand. WD40 repeats are found in over 27 different proteins which represent diverse 
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functional classes (Neer, E.J., et al, Nature 371:291-300 (1994)). These regulate cellular 
functions including cell division, cell fate determination, gene transcription, signal 
transduction, protein degradation, mRNA modification and vesicle fusion. This diversity in 
function has led to the hypothesis that [3-transducin family members provide a common 
5 scaffolding upon which multiprotein complexes can be assembled. 

The nucleotide sequence given in SEQ ID NO:l corresponds to the nucleotide 
sequence encoding hhsel-10, while the nucleotide sequence given in SEQ ID NO:2 
corresponds to the nucleotide sequence encoding hmsel-10. The isolation and sequencing of 
DNA encoding sel-10 is described below in Examples 1 and 2. 

10 As is described in Examples 1 and 2, automated sequencing methods were used to 

obtain the nucleotide sequence of sel-10. The sel-10 nucleotide sequences of the present 
invention were obtained for both DNA strands, and are believed to be 100% accurate. 
However, as is known in the art, nucleotide sequence obtained by such automated methods 
may contain some errors. Nucleotide sequences determined by automation are typically at 

15 least about 90%, more typically at least about 95% to at least about 99.9% identical to the 
actual nucleotide sequence of a given nucleic acid molecule. The actual sequence may be 
more precisely determined using manual sequencing methods, which are well known in the 
art. An error in sequence which results in an insertion or deletion of one or more 
nucleotides may result in a frame shift in translation such that the predicted amino acid 

20 sequence will differ from that which would be predicted from the actual nucleotide sequence 
of the nucleic acid molecule, starting at the point of the mutation. The sel-10 DNA of the 
present invention includes cDNA, chemically synthesized DNA, DNA isolated by PCR, 
genomic DNA, and combinations thereof. Genomic sel-10 DNA may be obtained by 
screening a genomic library with the sel-10 cDNA described herein, using methods that are 

25 well known in the art. RNA transcribed from sel-10 DNA is also encompassed by the 
present invention. 

Due to the degeneracy of the genetic code, two DNA sequences may differ and yet 
encode identical amino acid sequences. The present invention thus provides isolated nucleic 
acid molecules having a polynucleotide sequence encoding any of the sel-10 polypeptides of 
30 the invention, wherein said polynucleotide sequence encodes a sel-10 polypeptide having the 
complete amino acid sequence of SEQ ID NOs:3-10, or fragments thereof. 

Also provided herein are purified sel-10 polypeptides, both recombinant and non- 
recombinant. Variants and derivatives of native sel-10 proteins that retain any of the 
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biological activities of sel-10 are also within the scope of the present invention. As is 
described above, the sel-10 polypeptides of the present invention share homology with yeast 
CDC4. As CDC4 is known to catalyze ubiquitination of specific cellular proteins (Feldman 
et al, Cell 97:221 (1997)), it may be inferred that sel-10 will also have this activity. Assay 
5 procedures for demonstrating such activity are well known, and involve reconstitution of the 
ubiquitinating system using purified human sel-10 protein together with the yeast proteins 
Cdc4p, Cdc53p and Skplp, or their human orthologs, and an El enzyme, the E2 enzyme 
Cdc34p or its human ortholog, ubiquitin, a target protein and an ATP regenerating system 
(Feldman et al, 1997). Skplp associates with Cdc4p through a protein domain called an F- 

10 box (Bai et al, Cell 86:263 (1996)). The F-box protein motif is found in yeast CDC4, 
C. elegans sel-10, mouse sel-10 and human sel-10. The sel-10 ubiquitination system may be 
reconstituted with the C. elegans counterparts of the yeast components, e.g., cul-1 (also 
known as lin-19) protein substituting for Cdc53p (Kipreos et al, Cell S5:829 (1996)) and 
the protein F46A9 substituting for Skplp, or with their mammalian counterparts, e.g., Cul-2 

15 protein substituting for Cdc53p (Kipreos et al, 1996) and mammalian Skplp substituting 
for yeast Skplp. A phosphorylation system provided by a protein kinase is also included in 
the assay system as per Feldman et al, 1997. 

Sel-10 variants may be obtained by mutation of native sel-10-encoding nucleotide 
sequences, for example. A sel-10 variant, as referred to herein, is a polypeptide substantially 

20 homologous to a native sel-10 but which has an amino acid sequence different from that of 
native sel-10 because of one or more deletions, insertions, or substitutions in the amino acid 
sequence. The variant amino acid or nucleotide sequence is preferably at least about 80% 
identical, more preferably at least about 90% identical, and most preferably at least about 
95% identical, to a native sel-10 sequence. Thus, a variant nucleotide sequence which 

25 contains, for example, 5 point mutations for every one hundred nucleotides, as compared to 
a native sel-10 gene, will be 95% identical to the native protein. The percentage of sequence 
identity, also termed homology, between a native and a variant sel-10 sequence may also be 
determined, for example, by comparing the two sequences using any of the computer 
programs commonly employed for this purpose, such as the Gap program (Wisconsin 

30 Sequence Analysis Package, Version 8 for Unix, Genetics Computer Group, University 
Research Park, Madison Wisconsin), which uses the algorithm of Smith and Waterman 
{Adv. Appl. Math. 2: 482-489 (1981)). 

Alterations of the native amino acid sequence may be accomplished by any of a 
number of known techniques. For example, mutations may be introduced at particular 



6142.N2CP £ 



locations by procedures well known to the skilled artisan, such as oligonucleotide- directed 
mutagenesis, which is described by Walder et al. (Gene 42:133 (1986)); Bauer et al. (Gene 
37:73 (1985)); Craik (BioTechniques, January 1985, pp. 12-19); Smith et al. (Genetic 
Engineering: Principles and Methods, Plenum Press (1981)); and U.S. Patent Nos. 
5 4,518,584 and 4,737,462. 

Sel-10 variants within the scope of the invention may comprise conservatively 
substituted sequences, meaning that one or more amino acid residues of a sel-10 polypeptide 
are replaced by different residues that do not alter the secondary and/or tertiary structure of 
the sel-10 polypeptide. Such substitutions may include the replacement of an amino acid by 

10 a residue having similar physicochemical properties, such as substituting one aliphatic 
residue (He, Val, Leu or Ala) for another, or substitution between basic residues Lys and 
Arg, acidic residues Glu and Asp, amide residues Gin and Asn, hydroxyl residues Ser and 
Tyr, or aromatic residues Phe and Tyr. Further information regarding making 
pheno typically silent amino acid exchanges may be found in Bowie et al., Science 247:1306- 

15 1310 (1990). Other sel-10 variants which might retain substantially the biological activities 
of sel-10 are those where amino acid substitutions have been made in areas outside 
functional regions of the protein. 

In another aspect, the invention provides an isolated nucleic acid molecule 
comprising a polynucleotide which hybridizes under stringent conditions to a portion of the 

20 nucleic acid molecules described above, e.g., to at least about 15 nucleotides, preferably to 
at least about 20 nucleotides, more preferably to at least about 30 nucleotides, and still more 
preferably to at least about from 30 to at least about 100 nucleotides, of one of the 
previously described nucleic acid molecules. Such portions of nucleic acid molecules having 
the described lengths refer to, e.g., at least about 15 contiguous nucleotides of the reference 

25 nucleic acid molecule. By stringent hybridization conditions is intended overnight incubation 
at about 42/C for about 2.5 hours in 6 X SSC/0.1% SDS, followed by washing of the filters 
in 1.0 X SSC at 65/C, 0.1% SDS. 

Fragments of the sel-10-encoding nucleic acid molecules described herein, as well as 
polynucleotides capable of hybridizing to such nucleic acid molecules may be used as a 

30 probe or as primers in a polymerase chain reaction (PCR). Such probes may be used, e.g., 
to detect the presence of sel-10 nucleic acids in in vitro assays, as well as in Southern and 
northern blots. Cell types expressing sel-10 may also be identified by the use of such probes. 
Such procedures are well known, and the skilled artisan will be able to choose a probe of a 
length suitable to the particular application. For PCR, 5' and 3' primers corresponding to the 
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termini of a desired sel-10 nucleic acid molecule are employed to isolate and amplify that 
sequence using conventional techniques. 

Other useful fragments of the sel-10 nucleic acid molecules are antisense or sense 
oligonucleotides comprising a single- stranded nucleic acid sequence capable of binding to a 
5 target sel-10 mRNA (using a sense strand), or sel-10 DNA (using an antisense strand) 
sequence. 

In another aspect, the invention includes sel-10 polypeptides with or without 
associated native pattern glycosylation. Sel-10 expressed in yeast or mammalian expression 
systems (discussed below) may be similar to or significantly different from a native sel-10 

10 polypeptide in molecular weight and glycosylation pattern. Expression of sel-10 in bacterial 
expression systems will provide non- glycosylated sel-10. 

The polypeptides of the present invention are preferably provided in an isolated form, 
and preferably are substantially purified. Sel-10 polypeptides may be recovered and purified 
from recombinant cell cultures by well-known methods, including ammonium sulfate or 

15 ethanol precipitation, anion or cation exchange chromatography, phosphocellulose 
chromatography, hydrophobic interaction chromatography, affinity chromatography, 
hydroxylapatite chromatography and lectin chromatography. In a preferred embodiment, 
high performance liquid chromatography (HPLC) is employed for purification. 

The present invention also relates to vectors comprising the polynucleotide 

20 molecules of the invention, as well as host cell transformed with such vectors. Any of the 
polynucleotide molecules of the invention may be joined to a vector, which generally 
includes a selectable marker and an origin of replication, for propagation in a host. Because 
the invention also provides sel-10 polypeptides expressed from the polynucleotide molecules 
described above, vectors for the expression of sel-10 are preferred. The vectors include 

25 DNA encoding any of the sel-10 polypeptides described above or below, operably linked to 
suitable transcriptional or translational regulatory sequences, such as those derived from a 
mammalian, microbial, viral, or insect gene. Examples of regulatory sequences include 
transcriptional promoters, operators, or enhancers, mRNA ribosomal binding sites, and 
appropriate sequences which control transcription and translation. Nucleotide sequences are 

30 operably linked when the regulatory sequence functionally relates to the DNA encoding sel- 
10. Thus, a promoter nucleotide sequence is operably linked to a sel-10 DNA sequence if 
the promoter nucleotide sequence directs the transcription of the sel-10 sequence. 

Selection of suitable vectors to be used for the cloning of polynucleotide molecules 
encoding sel-10, or for the expression of sel-10 polypeptides, will of course depend upon the 
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host cell in which the vector will be transformed, and, where applicable, the host cell from 
which the sel-10 polypeptide is to be expressed. Suitable host cells for expression of sel-10 
polypeptides include prokaryotes, yeast, and higher eukaryotic cells, each of which is 
discussed below. 

5 The sel-10 polypeptides to be expressed in such host cells may also be fusion 

proteins which include regions from heterologous proteins. Such regions may be included to 
allow, e.g., secretion, improved stability, or facilitated purification of the polypeptide. For 
example, a sequence encoding an appropriate signal peptide can be incorporated into 
expression vectors. A DNA sequence for a signal peptide (secretory leader) may be fused 

10 in-frame to the sel-10 sequence so that sel-10 is translated as a fusion protein comprising the 
signal peptide. A signal peptide that is functional in the intended host cell promotes 
extracellular secretion of the sel-10 polypeptide. Preferably, the signal sequence will be 
cleaved from the sel-10 polypeptide upon secretion of sel-10 from the cell. Non-limiting 
examples of signal sequences that can be used in practicing the invention include the yeast 

15 I-factor and the honeybee melatin leader in sf9 insect cells. 

In a preferred embodiment, the sel-10 polypeptide will be a fusion protein which 
includes a heterologous region used to facilitate purification of the polypeptide. Many of the 
available peptides used for such a function allow selective binding of the fusion protein to a 
binding partner. For example, the sel-10 polypeptide may be modified to comprise a peptide 

20 to form a fusion protein which specifically binds to a binding partner, or peptide tag. 
Non-limiting examples of such peptide tags include the 6-His tag, thioredoxin tag, FLAG 
tag, hemaglutinin tag, GST tag, and OmpA signal sequence tag. As will be understood by 
one of skill in the art, the binding partner which recognizes and binds to the peptide may be 
any molecule or compound including metal ions (e.g., metal affinity columns), antibodies, or 

25 fragments thereof, and any protein or peptide which binds the peptide. These tags may be 
recognized by fluorescein or rhodamine labeled antibodies that react specifically with each 
type of tag 

Suitable host cells for expression of sel-10 polypeptides include prokaryotes, yeast, 
and higher eukaryotic cells. Suitable prokaryotic hosts to be used for the expression of sel- 
30 10 include bacteria of the genera Escherichia, Bacillus, and Salmonella, as well as members 
of the genera Pseudomonas, Streptomyces, and Staphylococcus. For expression in, e.g., E. 
coli, a sel-10 polypeptide may include an N-terminal methionine residue to facilitate 
expression of the recombinant polypeptide in a prokaryotic host. The N-terminal Met may 
optionally then be cleaved from the expressed sel-10 polypeptide. 




Expression vectors for use in prokaryotic hosts generally comprise one or more 
phenotypic selectable marker genes. Such genes generally encode, e.g., a protein that 
confers antibiotic resistance or that supplies an auxotrophic requirement. A wide variety of 
such vectors are readily available from commercial sources. Examples include pSPORT 

5 vectors, pGEM vectors (Promega), pPROEX vectors (LTI, Bethesda, MD), Bluescript 
vectors (Stratagene), and pQE vectors (Qiagen). 

Sel-10 may also be expressed in yeast host cells from genera including 
Saccharomyces, Pichia, and Kluveromyces. Preferred yeast hosts are S. cerevisiae and P. 
pastoris. Yeast vectors will often contain an origin of replication sequence from a 2T yeast 

10 plasmid, an autonomously replicating sequence (ARS), a promoter region, sequences for 
polyadenylation, sequences for transcription termination, and a selectable marker gene. 

Vectors replicable in both yeast and E. coli (termed shuttle vectors) may also be 
used. In addition to the above-mentioned features of yeast vectors, a shuttle vector will also 
include sequences for replication and selection in E. coli. Direct secretion of sel-10 

15 polypeptides expressed in yeast hosts may be accomplished by the inclusion of nucleotide 
sequence encoding the yeast I-factor leader sequence at the 5' end of the sel-10-encoding 
nucleotide sequence. 

Insect host cell culture systems may also be used for the expression of Sel-10 
polypeptides. In a preferred embodiment, the sel-10 polypeptides of the invention are 
20 expressed using a baculovirus expression system. Further information regarding the use of 
baculo virus systems for the expression of heterologous proteins in insect cells are reviewed 
by Luckow and Summers, Bio/Technology 6:41 (1988). 

In another preferred embodiment, the sel-10 polypeptide is expressed in mammalian 
host cells. Non-limiting examples of suitable mammalian cell lines include the COS-7 line of 
25 monkey kidney cells (Gluzman et al, Cell 23:175 (1981)) and Chinese hamster ovary 
(CHO) cells. 

The choice of a suitable expression vector for expression of the sel-10 polypeptides 
of the invention will of course depend upon the specific mammalian host cell to be used, and 
is within the skill of the ordinary artisan. Examples of suitable expression vectors include 
30 pcDNA3 (Invitrogen) and pSVL (Pharmacia Biotech). Expression vectors for use in 
mammalian host cells may include transcriptional and translational control sequences derived 
from viral genomes. Commonly used promoter sequences and enhancer sequences which 
may be used in the present invention include, but are not limited to, those derived from 
human cytomegalovirus (CMV), Adenovirus 2, Polyoma virus, and Simian virus 40 (SV40). 
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Methods for the construction of mammalian expression vectors are disclosed, for example, 
in Okayama and Berg (Mol. Cell. Biol. 5:280 (1983)); Cosman et al. (Mol. Immunol. 
23:935 (1986)); Cosman et al. {Nature 312:16$ (1984)); EP-A-0367566; and WO 
91/18982. 

5 The polypeptides of the present invention may also be used to raise polyclonal and 

monoclonal antibodies, which are useful in diagnostic assays for detecting sel-10 polypeptide 
expression. Such antibodies may be prepared by conventional techniques. See, for example, 
Antibodies: A Laboratory Manual, Harlow and Land (eds.), Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, N.Y., (1988); Monoclonal Antibodies, Hybridomas: A New 

10 Dimension in Biological Analyses, Kennet et al. (eds.), Plenum Press, New York (1980). 

The sel-10 nucleic acid molecules of the present invention are also valuable for 
chromosome identification, as they can hybridize with a specific location on a human 
chromosome. There is a current need for identifying particular sites on the chromosome, as 
few chromosome marking reagents based on actual sequence data (repeat polymorphisms) 

15 are presently available for marking chromosomal location. Once a sequence has been 
mapped to a precise chromosomal location, the physical position of the sequence on the 
chromosome can be correlated with genetic map data. The relationship between genes and 
diseases that have been mapped to the same chromosomal region can then be identified 
through linkage analysis, wherein the coinheritance of physically adjacent genes is 

20 determined. Whether a gene appearing to be related to a particular disease is in fact the 
cause of the disease can then be determined by comparing the nucleic acid sequence between 
affected and unaffected individuals. 

The sel-10 polypeptides of the invention, and the DNA encoding them, may also be 
used to further elucidate the biological mechanism of AD, and may ultimately lead to the 

25 identification of compounds that can be used to alter such mechanisms. The sel-10 
polypeptides of the invention are 47.6% identical and 56.7% similar to C. elegans sel-10. 
As is described above, mutations to C. elegans sel-10 are known to suppress mutations to 
sel-12 that result in a loss-of-function for egg laying, and also to suppress certain 
hypomorphic mutations to lin-Yl. Mutations to C. elegans sel-12 can also be rescued by 

30 either of the human AD-linked genes PS-1 (42.7% identical to sel-\2) or PS-2 (43.4% 
identical to sel-12). However, human PS-1 with a familial AD-linked mutant has a reduced 
ability to rescue sel-12 mutants (Levitan, D. et al., Proc. Natl Acad. Sci. USA 93: 14940- 
14944 (1996)). 
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This demonstrated interchangeability of human and C. elegans genes in the notch 
signaling pathway makes it reasonable to predict that mutations of human sel-10 will 
suppress mutations to PS-1 or PS-2 that lead to AD, especially in light of the predicted 
structure of sel-10. As described above, PS-1 and PS-2 mutations that lead to AD are those 

5 which interfere with the proteolytic processing of PS-1 or PS-2. The sel-10 polypeptides of 
the invention are members of the pVtransducin protein family, which includes yeast CDC4, a 
component of an enzyme which functions in the ubiquitin-dependent protein degradation 
pathway. Thus, human sel-10 may regulate presenilin degradation via the ubiquitin- 
proteasome pathway. Alternatively, or in addition, human sel-10 may alter presenilin 

10 function by targeting for degradation through ubiquitination a modulator of presenilin 
activity, e.g., a negative regulator. Therefore, mutations to sel-10 may reverse the faulty 
proteolytic processing of PS-1 or PS-2 which occurs as a result of mutation to PS-1 or PS-2 
or otherwise increase presenilin function. For the same reason, inhibition of sel-10 activity 
may also act to reverse PS-1 or PS-2 mutations. Thus, it may be hypothesized that 

15 compounds which inhibit either the expression or the activity of the human sel-10 
polypeptides of the invention may reverse the effects of mutations to PS-1 or PS-2, and thus 
be useful for the prevention or treatment of AD. 

Thus, C. elegans may be used as a genetic system for the identification of agents 
capable of inhibiting the activity or expression of the human sel-10 polypeptides of the 

20 invention. A suitable C. elegans strain for use in such assays lacks a gene encoding active C. 
elegans sel-10, and exhibits a loss-of-function for egg-laying resulting from an inactivated 
sel-12 gene. Construction of C. elegans strains having a loss-of-function for egg-laying due 
to mutation of sel-12 may be accomplished using routine methods, as both the sequence of 
sel-12 (Genebank accession number U35660) and mutations to sel-12 resulting in a loss-of- 

25 function for egg laying are known (see Levitan et al, Nature 377: 351-354 (1995), which 
describes construction of C. elegans sel-12(arl71)). An example of how to make such a 
strain is also given in Levitan et al. (Nature 377: 351-354 (1995)). Wild-type C. elegans 
sel-10 in the C. elegans sel-12(arl71)) , is also mutagenized using routine methods, such 
as the technique used for sel-12 mutagenesis in Levitan et al, supra. 

30 In order to identify compounds inhibiting human sel-10 activity, a DNA vector 

containing a human sel-10 gene encoding any of the wild-type human sel-10 proteins of the 
invention is introduced into the above- described C. elegans strain. In a preferred 
embodiment, the heterologous human sel-10 gene is integrated into the C. elegans genome. 
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The gene is then expressed, using techniques described in Levitan et al. (Proc. Natl. Acad. 
Sci. USA 93: 14940-14944 (1996)). Test compounds are then administered to this strain in 
order to determine whether a given agent is capable of inhibiting sel-10 activity so as to 
suppress mutations to sel-12 or lin-12 that result in egg-laying defects. Egg-laying in this 

5 strain is then determined, e.g. by the assay described in Levitan et al. {Proc. Natl. Acad. Sci. 
USA 93: 14940-14944 (1996)). To confirm that the compound's effect on egg-laying is due 
to inhibition of sel-10 activity, the action of the compound can be tested in a second 
biochemical or genetic pathway that is known to be affected by loss-of-function mutations in 
sel-10 {e.g., further elevation of lin-12 activity in lin- 12(d) hypomorphic strains). Such 

10 assays may be performed as described in Sundarem and Greenwald {Genetics 135: 765-783 
(1993)). 

Alternatively, compounds are tested for their ability to inhibit the E3 Ubiquitin 
Ligating Enzyme. Assays procedures for demonstrating such activity are well known, and 
involve reconstitution of the ubiquitinating system using purified human sel-10 protein 

15 together with the yeast proteins Cdc4p, Cdc53p and Skplp and an El enzyme, the E2 
enzyme Cdc34p, ubiquitin, a target protein and an ATP regenerating system (Feldman et al, 
1997). The sel-10 ubiquitination system may also be reconstituted with the C. elegans 
counterparts of the yeast components, e.g., cul-1 (also known as lin- 19) protein substituting 
for Cdc53p (Kipreos et al, Cell 85:829 (1996)) and the protein F46A9 substituting for 

20 Skplp, or with their mammalian counterparts, e.g., Cul-2 protein substituting for Cdc53p 
(Kipreos et al, ibid.) and mammalian Skplp substituting for yeast Skplp. A 
phosphorylation system provided by a protein kinase is also to be included in the assay 
system as per Feldman et al, 1997. 

Alternatively, cell lines which express human sel-10 due to transformation with a 

25 human sel-10 cDNA and which as a consequence have elevated APP processing and 
formation of Ap^o or Af^-42 may also be used for such assays as in Example 3. Compounds 
may be tested for their ability to reduce the elevated Afi processing seen in the sel-10 
transformed cell line. 

Compounds that rescue the egg-laying defect or that inhibit E3 Ubiquitin Ligating 
30 Enzyme are then screened for their ability to cause a reduction in the production of A-beta!. 
40 or A-betai_42 in a human cell line. Test compounds are used to expose IMR-32 or other 
human cell lines known to produce A-betai_4o or A-betai-42 (Asami-Okada et al, 
Biochemistry 34: 10272-10278 (1995)), or in human cell lines engineered to express human 
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APP at high levels. In these assays, A-beta^o or A-betai_4 2 is measured in cell extracts or 
after release into the medium by ELISA or other assays which are known in the art 
(Borchelt et al, Neuron 17: 1005-1013 (1996); Citron et al, Nat. Med. 3: 67-72 (1997)). 

5 Having generally described the invention, the same will be more readily 

understood by reference to the following examples, which are provided by way of 
illustration and are not intended as limiting. 

EXAMPLES 

10 

Example 1: Identification of a human homologue to C. elegans sel-10 
Results 

Identification of sel-10 in ACEDB: Sel-10 maps between the cloned polymorphisms arP3 
and TCPARI just to the left of him-5 [ACEDB entry wm95p536]. Three phage lambda 

15 clones have been sequenced across the interval, F53C11, F09F3, and F55B12. Sel-10 is 
reported to have homology to yeast cdc4 [ACEDB entry wm97ab259]. Blast search 
revealed a single ORF with homology to yeast cdc4 (CC4_YST) within the interval defined 
by arP3 and TCPARI corresponding to the GenPep entry F55B12.3. F55B12.3, like yeast 
cdc4, is a member of the [3-transducin protein family. This family is characterized by the 

20 presence of multiple WD40 repeats [Neer, E.J. et al, Nature 371: 297-300 (1994)]. 

Identification of a human sel-10 homologue, Incyte 028971: The GenPep entry F55B12.3 
was used to search the LifeSeq, LifeSeq FL and EMBL data bases using tblastn. The 
search revealed multiple homologies to (3-transducin family members including LIS-1 

25 (S36113 and P43035), a gene implicated in Miller-Dieker lissencephaly, a Xenopus laevis 
gene, TRCPXEN (U63921), and a human contig in LifeSeq FL, 028971. Since there also 
are multiple [3-transducin family members within the C. elegans genome, these were 
collected using multiple blast searches and then clustered with the sel-10 candidate genes. 
Multiple alignments were performed with the DNAStar program Megalign using the 

30 Clustal method. This revealed that LIS-1 clustered with T03F6.F, a different (3-transducin 
family member and thus excluded it as a candidate sel-10 homologue. TRCPXEN 
clustered with K10B2.1, a gene which also clusters with F55B12.3 and CC4YST, while 
Incyte 028971 clustered with sel-10. Thus, Incyte 028971 appears to encode the human 
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homologue of C. elegans sel-10. Sequence homology between sel-10 and 028971 is 
strongest in the region of the protein containing 7 repeats of the WD40 motif. The Incyte 
028971 contig contains 44 ESTs from multiple libraries including pancreas, lung, T- 
lymphocytes, fibroblasts, breast, hippocampus, cardiac muscle, colon, and others. 

5 

Public EST: Blastx searches with the DNA sequence 028971 against the TREMBLP 
dataset identified a single homologous mouse EST (W85144) from the IMAGE Library, 
Soares mouse embryo NbME13.5-14.5. The blastx alignment of 028971 with W85144 and 
then with F55B12.3 revealed a change in reading frame in 028971 which probably is due to 
10 a sequencing error. 

Blastn searches of the EMBL EST database with the 028971 DNA sequence 
revealed in addition to W85144, three human EST that align with the coding sequence of 
028971 and six EST that align with the 3' untranslated region of the 028971 sequence. 

15 Protein Motifs: Two protein motifs were identified in F55B12.3 which are shared with 
yeast cdc4, mouse w85144 and human 028971. These are an F-box in the N-terminal 
domain and seven P-transducin repeats in the C-terminal domain. 

Discussion 

20 The sel-10 gene encodes a member of the (3-transducin protein family. These are 

characterized by the presence of multiple WD40 repeats [Neer, E.J. et al, Nature 371: 
297-300 (1994)]. The repeats are 20-40 amino acids long and are bounded by gly-his 
(GH) and trp-asp (WD) residues. Solution of the three dimensional structure of (5- 
transducin indicates that the WD40 repeats form the arms of a seven-bladed propeller like 

25 structure [Sondek, J. et al, Nature 379: 369-74 (1996)]. Each blade is formed by four 
alternating pleats of beta-sheet with a pair of the conserved aspartic acid residues in the 
protein motif forming the limits of one internal beta strand. WD40 repeats are found in 
over 27 different proteins which represent diverse functional classes [Neer, E.J. et al, 
Nature 371: 297-300 (1994)]. These regulate cellular functions including cell division, cell 

30 fate determination, gene transcription, signal transduction, protein degradation, mRNA 
modification and vesicle fusion. This diversity in function has led to the hypothesis that (3- 
transducin family members provide a common scaffolding upon which multiprotein 
complexes can be assembled. 
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The homology of sel-10, 28971 and W85144 to the yeast cdc4 gene suggests a 
functional role in the ubiquitin-proteasome pathway for intracellular degradation of protein. 
Mutations of the yeast cdc4 gene cause cell cycle arrest by blocking degradation of Sic 1, an 
inhibitor of S-phase cyclin/cdk complexes [King, R.W. et al, Science 274: 1652-9 (1996)]. 

5 Phosphorylation of Sicl targets it for destruction through the ubiquitin-proteasome 
pathway. This pathway consists of three linked enzyme reactions that are catalyzed by 
multiprotein complexes [Ciechanover, A., Cell 79: 13-21 (1994); Ciechanover, A. and A.L. 
Schwartz, FASEB J. 8: 182-91 (1994)]. Initially, the C-terminal glycine of ubiquitin is 
activated by ATP to form a high energy thiol ester intermediate in a reaction catalyzed by 

10 the ubiquitin- activating enzyme, El. Following activation, an E2 enzyme (ubiquitin 
conjugating enzyme) transfers ubiquitin from El to the protein target. In some cases, E2 
acts alone. In others, it acts in concert with an E3 ubiquitin-ligating enzyme which binds 
the protein substrate and recruits an E2 to catalyze ubiquitination. E2 ubiquitin- 
conjugating enzymes comprise a fairly conserved gene family, while E3 enzymes are 

15 divergent in sequence [Ciechanover, A., Cell 79: 13-21 (1994); Ciechanover, A. and A.L. 
Schwartz, FASEB J. 8: 182-91 (1994)]. 

In yeast, mutation of the E2 ubiquitin-conjugating enzyme, cdc34, causes cell 
cycle arrest through failure to degrade the Sicl inhibitor of the S-phase cyclin/cdk complex 
[King, R.W. et al, Science 274: 1652-9 (1996)]. Sicl normally is degraded as cells enter 

20 the Gl-S phase transition, but in the absence of cdc34, Sicl escapes degradation and its 
accumulation causes cell cycle arrest. Besides cdc34, cdc4 is one of three other proteins 
required for the Gl-S phase transition. The other two are cdc53 and Skpl. As discussed 
above, cdc4 contains two structural motifs, seven WD40 repeats (which suggests that the 
protein forms a beta-propeller) and a structural motif shared with cyclin F which is an 

25 interaction domain for Skpl [Bai, C. et al, Cell 86: 263-7 r 4 (1996)]. Insect cell lysates 
containing cdc53, cdc4 and skpl (and also ubiquitin, cdc34 and El) can transfer ubiquitin 
to Sicl suggesting that one or more of these components functions as an E3 ubiquitin- 
ligating enzyme [King, R.W. et al, Science 274: 1652-9 (1996)]. Increased expression of 
either cdc4 or Skpl partially rescues loss of the other. 

30 In C. elegans, mutation of sel-10 has no visible phenotype indicating that sel-10 

does not play a role in regulation of the cell-cycle. A closely related, C. elegans 
transducin family member, K10B2.6 may play that role as it clusters with the gene 
TRCP_XEN from Xenopus laevis which rescues yeast cell cycle mutants arrested in late 
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anaphase due to a failure to degrade cyclin B [Spevak,W. et al, Mol. Cell. Biol. 13: 4953- 
66 (1993)]. If sel-10 does encode a component of an E3-ubiquitin ligating enzyme, how 
might it suppress sel-12 and enhance lin-12 mutations? The simplest hypothesis is that sel- 
10 regulates degradation of both proteins via the ubiquitin-proteasome pathway. Both sel- 

5 12 and lin-12 are transmembrane proteins. Sel-12 crosses the membrane 8 times such that 
its N- and C-termini face the cytosol [Kim, T.W. et al, J. Biol. Chem. 272: 11006-10 
(1997)], while lin-12 is a type 1 transmembrane protein (Greenwald, I. and G. Seydoux, 
Nature 346: 197-9 (1990)). Both are ubiquitinated, and in the case of human PS2, steady 
state levels increase in cells treated with an inhibitor of the proteasome, N-acetyl-L- 

10 leucinal-L-norleucinal and lactacystin (Li, X. and I. Greenwald, Neuron. 17: 1015-21 
(1996)). Alternatively, sel-10 may target for degradation of a negative regulator of 
presenilin function. 

The genetic analysis and protein function suggested by homology to cdc4 implies 
that drug inhibitors of human sel-10 may increase steady state levels of human presenilins. 
15 This could potentiate activity of the presenihn pathway and provide a means for therapeutic 
intervention in Alzheimer's disease. 

Example 2: 5' RACE cloning of a human cDNA encoding Sel-10, an extragenic 
suppressor of presenilin mutations in C. elegans 

20 

Materials and Methods 

Oligonucleotide primers for the amplification of the sel-10 coding sequence from 
C. elegans cDNA were prepared based on the sequence of F55B12.3, identified in Example 
1 as the coding sequence for C. elegans sel-10. The primers prepared were: 

25 5 -CGGGATCC ACC ATGGATGATGG ATCGATGAC ACC-3 ' (SEQ ID NO: 11) and 
5-GGAATTCCTTAAGGGTATACAGCATCAAAGTCG-3' (SEQ ID NO: 12). C. 
elegans mRNA was converted to cDNA using a BRL Superscript II Preamplification kit. 
The PCR product was digested with restriction enzymes BamHl and EcoRl (LTI, 
Gaithersberg, MD) and cloned into pcDNA3.1 (Invitrogen). Two isolates were sequenced 

30 (ABI, Perkin-Elmer Corp). 

The sequence of Incyte clone 028971 (encoding a portion of the human 
homologue of C. elegans sel-10), was used to design four antisense oligonucleotide 
primers: 5 -TC ACTTC ATGTCC AC ATC AAAGTCC-3 ' (SEQ ID NO: 13), 5-GGTAA- 
TTACAAGTTCTTGTTGAACTG (SEQ ID NO: 14), 5 '-CCCTGC AACGTGTGT- 
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AGACAGG-3' (SEQ ID NO: 15), and 5-CCAGTCTCTGCATTCCACACTTTG-3' (SEQ 
ID NO: 16) to amplify the missing 5' end of human sel-10. The Incyte LifeSeq "Electronic 
Northern" analysis was used to identify tissues in which sel-10 was expressed. Two of 
these, hippocampus and mammary gland, were chosen for 5' RACE cloning using a 

5 CloneTech Marathon kit and prepared Marathon-ready cDNA from hippocampus and 
mammary gland. PCR products were cloned into the TA vector pCR3.1 (Invitrogen), and 
isolates were sequenced. An alternate 5' oligonucleotide primer was also designed based 
on Incyte clones which have 5' ends that differ from the hippocampal sel-10 sequence: 
5 '-CTC AGAC AGGTC AGGACATTTGG-3 ' (SEQ ID NO: 17 ). 

10 Blastn was used to search Incyte databases LifeSeq and LifeSeqFL. Gap 

alignments and translations were performed with GCG programs (Wisconsin Sequence 
Analysis Package, Version 8 for Unix, Genetics Computer Group, University Research 
Park, Madison Wisconsin). 

15 Results 

The coding sequence of the C. elegans sel-10: The predicted coding sequence of the C. 
elegans sel-10, F55B12.3, had originally been determined at the Genome Sequencing 
Center, Washington University, St. Louis, by using the computer program GeneFinder to 
predict introns and exons in the genomic cosmid F55B12. The hypothetical cDNA 
20 sequence was confirmed by amplifying this region from C. elegans cDNA, cloning, and 
sequencing it . 

The coding sequence of the human sel-10 gene homologue: All of the 028971 antisense 
oligonucleotides amplified a 5' RACE product from human hippocampal and mammary 

25 cDNA. The longest PCR product from the hippocampal reactions was cloned and 
sequenced. This PCR reaction was designed to generate products which end at the 
predicted stop codon. Two isolates contained identical sequence which begins 880 bases 
before the beginning of the 028971 sequence. This sequence was confirmed by comparison 
with spanning Incyte cDNA clones. The Incyte clones that spanned the 5' end of the 

30 human sel-10 homologue were not annotated as F55B12.3, as the homology in this region 
between the human and C. elegans genes is low, and as the overlap between these clones 
and the annotated clones happened to be too small for them to be clustered in the Incyte 
database or uncovered by our blasting the Incyte database with the 028971 sequence. 
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The predicted protein sequences of human sel-10 have 47.6% identity and 56.7% 
similarity to C. elegans sel-10. The N-tenninus of the human sel-10 sequence begins with 
4 in-frame methionines. In addition to the WD40 repeats described above, the human 
sequence also contains a region homologous to the CDC4 F-box for binding Skpl, as 
5 expected for a sel-10 homologue. 

Different human sel-10 mRNAs expressed in mammary and hippocampal tissues: 

Several additional human sel-10 ESTs which differ from the hippocampal sequence were 
identified. These are an exact match, which indicates that the alternative transcript is 

10 probably real. Comparison of these sequences with the human hippocampal sel-10 
sequence shows divergence prior to the 4th in-frame methionine and then exact sequence 
match thereafter. An oligonucleotide primer specific for the 5' end of this alternative 
transcript was found to amplify a product from mammary but not hippocampal cDNA. 
This indicates either that the human sel-10 transcript undergoes differential splicing in a 

15 tissue-specific fashion or that the gene contains multiple, tissue specific promoters. 

Discussion 

5 'RACE and PCR amplification were used to clone a full-length cDNA encoding the 
human homologue of the C elegans gene, sel-10. Sequence analysis confirms the earlier 

20 prediction that sel-10 is a member of the CDC4 family of proteins containing F-Box and 
WD40 Repeat domains. Two variants of the human sel-10 cDNA were cloned from 
hippocampus and mammary gland which differed in 5' sequence preceding the apparent site 
of translation initiation. This implies that the gene may have two or more start sites for 
transcription initiation which are tissue-specific or that the pattern of exon splicing is 

25 tissue- specific. 

EXAMPLE 3: Expression Of Epitope- Tagged Sel-10 In Human Cells , and 

PerturbationOf Amyloid Peptide Processing By Human Sel-10 

Materials And Methods 

Construction of Epitope-Tagged Sel-10: Subcloning, Cell Growth and Transfection: 
30 An EcoRl site was introduced in-frame into the human sel-10 cDNA using a 

polymerase chain reaction (PCR) primed with the oligonucleotides 237 (5'-GGAATTC- 
C ATGAAAAGATTGG ACC ATGGTTCTG-3 ' ) (SEQ ID NO: 18) and 206 (5'-GGA- 
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ATTCCTC ACTTC ATGT-C AC ATC AAAGTCC AG- 3 ' ) (SEQ ID NO: 19). The resulting 
PCR product was cloned into the EcoRl site of the vector pCS2+MT. This fused a 5' 6- 
myc epitope tag in- frame to the fifth methionine of the hippocampal sel-10 cDNA, i.e., 
upstream of nucleotide 306 of the sequence given in SEQ ID NO:l. The nucleotide 
5 sequence of this construct, designated 6myc-N- sel-10, is given in SEQ ID NO: 20, while 
the amino acid sequence of the polypeptide encoded thereby is given in SEQ ID NO: 21. 
The hippocampal and mammary sel-10 cDNA diverge upstream of this methionine. A PS1 
cDNA with a 3 '-FLAG tag (PS1-C-FLAG) was subcloned into the pcDNA3.1 vector. An 
APP cDNA containing the Swedish NL mutation and an attenuated ER retention sequence 

10 consisting of the addition of a di-lysyl motif to the C-terminus of APP695 (APP695NL-KK) 
was cloned into vector pIRES-EGFP (Mullan et al., Nat Genet 1992 Aug;l(5):345-7). 
HEK293 and IMR32 cells were grown to 80% confluence in DMEM with 10% FBS and 
transfected with the above cDNA. A total of 10 mg total DNA/6xl0 6 cells was used for 
transfection with a single plasmid. For cotransfections of multiple plasmids, an equal 

15 amount of each plasmid was used for a total of 10 mg DNA using LipofectAmine (BRL). 

In order to construct C-term V5 his tagged sel-10 and the C-term mychis tagged sel- 
10, the coding sequence of human hippocampal sel-10 was amplified using oligonucleotides 
primers containing a Kpnl restriction site on the 5' primer: 5-GGGTA- 
CCCCTC ATTATTCCCTCGAGTTCTTC-3 ' (SEQ ID NO:22) and an EcoRI site on the 3' 

20 primer: 5 -GGAATTCCTTCATGTCC ACATCAAAGTCC-3' (SEQ ID NO:23), using the 
original human sel-10 RACE per product as template. The product was digested with both 
Kpnl and EcoRI and cloned into either the vector pcDNA6/V5-His A or pcDNA3. 1/Myc- 
His(+) A (Invitrogen). The nucleotide sequence of independent isolates was confirmed by 
dideoxy sequencing. The nucleotide sequence of the C-term V5 his tagged sel-10 is given in 

25 SEQ ID NO: 24, while the amino acid sequence of the polypeptide encoded thereby is given 
in SEQ ID NO: 25. The nucleotide sequence of independent isolates was confirmed by 
dideoxy sequencing. The nucleotide sequence of the C-term mychis tagged sel-10 is given 
in SEQ ID NO: 26, while the amino acid sequence of the polypeptide encoded thereby is 
given in SEQ ID NO: 27. 

30 Clonal Selection of transformed cells by FACS: Cell samples were analyzed on 

an EPICS Elite ESP flow cytometer (Coulter, Hialeah, FL) equipped with a 488 nm 
excitation line supplied by an air-cooled argon laser. EGFP emission was measured through 
a 525 nm band-pass filter and fluorescence intensity was displayed on a 4-decade log scale 
after gating on viable cells as determined by forward and right angle light scatter. Single 
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green cells were separated into each well of one 96 well plate containing growth medium 
without G418. After a four day recovery period, G418 was added to the medium to a final 
concentration of 400 mg/ml. Wells with clones were expanded from the 96 well plate to a 24 
well plate and then to a 6 well plate with the fastest growing colonies chosen for expansion 
5 at each passage. 



transfection with 4% formaldehyde and 0.1% Triton X-100 in PBS for 30 min on ice and 
blocked with 10% Goat serum in PBS (blocking solution) 1 hr RT (i.e., 25°C), followed by 
incubation with mouse anti-myc (10 mg/ml) or rabbit anti-FLAG (0.5 mg/ml) antibody 4°C 

10 O/N and then fluorescein-labeled goat anti-mouse or anti-rabbit antibody (5mg/ml) in 
blocking solution 1 hr at 25°C. 

Western blotting: Cell lysates were made 48 hrs after transfection by incubating 
10 5 cells with 100 ml TENT (50 mM Tris-HCl pH 8.0, 2 mM EDTA, 150 mM NaCl, 1% 
Triton X-100, lx protease inhibitor cocktail) 10 min on ice followed by centrifugation at 

15 14,000 g. The supernatant was loaded on 4-12% NuPage gels (50 mg protein/lane) and 
electrophoresis and transfer were conducted using an Xcell II Mini-Cell system (No vex). 
The blot was blocked with 5% milk in PBS 1 hr RT and incubated with anti-myc or anti- 
FLAG antibody (described in "Immunofluorescence" above) 4°C O/N, then sheep anti- 
mouse or anti-rabbit antibody-HRP (0.1 mg/ml) 1 hr RT, foUowed by Supersignal (Pierce) 

20 detection. 



cells) were assayed in the following double antibody sandwich ELISA, which is capable of 
detecting levels of Afr^o and Afr-42 peptide in culture supernatant. 

Human AJ3 1-40 or 1-42 was measured using monoclonal antibody (mAb) 6E10 

25 (Senetek, St. Louis, MO) and biotinylated rabbit antiserum 162 or 164 (NYS Institute for 
Basic Research, Staten Island, NY) in a double antibody sandwich ELISA. The capture 
antibody 6E10 is specific to an epitope present on the N-terminal amino acid residues 1-16 
of hAp. The conjugated detecting antibodies 162 and 164 are specific for hA[3 1-40 and 1- 
42, respectively. The sandwich ELISA was performed according to the method of Pirttila et 

30 al. {Neurobiology of Aging 18: 121-7 (1997)). Briefly, a Nunc Maxisorp 96 well 

immunoplate was coated with lOOpl/well of mAb 6E10 (5^ig/ml) diluted in 0.1M carbonate- 
bicarbonate buffer, pH 9.6 and incubated at 4°C overnight. After washing the plate 3x with 
0.01M DPBS (Modified Dulbecco's Phosphate Buffered Saline (0.008M sodium phosphate, 



Immunofluorescence: 



Cells grown on slides were fixed 48 hrs after 



ELISA: 



Cell culture supernatant or cell lysates (100 ml formic acid/10 6 
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0.002M potassium phosphate, 0.14M sodium chloride, 0.01 M potassium chloride, pH 7.4) 
from Pierce, Rockford, II) containing 0.05% of Tween-20 (DPBST), the plate was blocked 
for 60 min with 200fil of 10% normal sheep serum (Sigma) in 0.01M DPBS to avoid non- 
specific binding. Human A(3 1-40 or 1-42 standards 100pJ/well (Bachem, Torrance, CA) 

5 diluted, from a lmg/ml stock solution in DMSO, in non transfected conditioned cell medium 
was added after washing the plate, as wen as lOOjil/weU of sample i.e. filtered conditioned 
medium of transfected cells. The plate was incubated for 2 hours at room temperature and 
4°C overnight. The next day, after washing the plate, lOOjil/weU biotinylated rabbit 
antiserum 162 1:400 or 164 1:50 diluted in DPBST + 0.5% BSA was added and incubated 

10 at room temperature for lhr 15 min. Following washes, lOOpJ/well neutravidin-horseradish 
peroxidase (Pierce, Rockford, II) diluted 1:10,000 in DPBST was applied and incubated for 
1 hr at room temperature. After the last washes 100|il/well of o-phenylnediamine 
dihydrochloride (Sigma Chemicals, St. Louis, MO) in 50mM citric acid/lOOmM sodium 
phosphate buffer (Sigma Chemicals, St. Louis, MO), pH 5.0, was added as substrate and the 

15 color development was monitored at 450nm in a kinetic microplate reader for 20 min. using 
Soft max Pro software. 

Results 

Transfection ofHEK293 cells: Transfection efficiency was monitored through 
the use of vectors that express green fluorescent protein (GFP) or by immunofluorescent 

20 detection of epitope-tagged sel- 10 or PS 1 . An N- terminal 6-myc epitope was used to tag 
human sel- 10 (6myc-N-sel-10), while PS1 was tagged with a C-terminal FLAG epitope 
(PS1-C-FLAG). APP695 was modified by inclusion of the Swedish NL mutation to increase 
A|3 processing and an attenuated endoplasmic reticulum (ER) retention signal consisting of a 
C-terminal di-lysine motif (APP695NL-KK). The di- lysine motif increases A[3 processing 

25 about two fold. The APP695NL-KK construct was inserted into the first cistron of a 
bicistronic vector containing GFP (pIRES-EGFP, Invitrogen) to allow us to monitor 
transfection efficiency. Transfection efficiency in HEK293 cells was about 50% for 
transfections with a single plasmid DNA. For cotransfections with two plasmids, about 30- 
40% of the cells expressed both proteins as detected by double immunofluorescence. 

30 Expression of recombinant protein in transfected HEK293 cells was confirmed by 

Western blot as illustrated for PS1-C-FLAG and 6myc-N-sel-10 (Fig 1A). In the case of 
cotransfections with three plasmids (PS1-C-FLAG + 6myc-N-sel-10 + APP), all three 
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proteins were detected in the same cell lysate by Western blot (Figure IB) using appropriate 
antibodies. 

Effect of 6myc-N-sel-I0 and PS1 -C-FLAG on A(3 processing: Cotransfection 
of APP695NL-KK with 6myc-N-sel-10 or PS1-C-FLAG into HEK293 cells increased the 

5 release of Abl-40 and Abl-42 peptide into the culture supernatant by 2- to 3-fold over 
transfections with just APP695NL-KK (Table 1). Cotransfection of APP695NL-KK with 
both 6myc-N-sel-10 and PS1-C-FLAG increased Ab release still further (i.e., 4- to 6-fold 
increase). In contrast, the ratio of Abl-42/ (Abl-40 + Abl-42) released into the supernatant 
decreased about 50%. The subtle decrease in the ratio of Abl-42/ (Abl-40 + Abl-42) 

10 reflects the larger increase in Ab 1-40 relative to Ab 1-42. Neither 6myc-N-sel-10 nor PS1- 
C-FLAG affected endogenous Ab production in HEK293 cells. Similar observations were 
also obtained in IMR32 cells (Table 2). However, IMR32 cells transfected less well than 
HEK293 cells, so the stimulation of APP695NL-KK processing by cotransfection with 
6myc-N-sel-10 or PS 1 -C-FLAG was lower. 

15 Levels of Ab 1-40 expressed in HEK293 cells transfected with APP695NL-KK were 

sufficient to measure Ab peptide in both the culture supernatant and cell pellet. 
Considerably more Ab 1-40 is detected in the HEK293 cell pellet than in the supernatant in 
cells transfected with just APP695NL-KK. Cotransfection with 6myc-N-sel-10 or PS1-C- 
FLAG proportionately decreased Ab 1-40 in the cell pellet and increased Ab in the culture 

20 supernatant. This implies that 6myc-N-sel-10 and PS 1 -C-FLAG alter processing or 
trafficking of APP such that proportionately more Ab is released from the cell. 

Effect of 6myc-N-sel-10 and PS1 -C-FLAG expression on endogenous A j3 
processing: The effect of 6myc-N-sel-10 on the processing of endogenous APP by human 
cells was assessed by creating stably transformed HEK293 cell lines expressing these 

25 proteins. Two cell lines expressing 6myc-N-sel-10 were derived (sel-10/2 & sel-10/6) as 
well as a control cell line transformed with pcDNA3.1 vector DNA. Both 6myc-N-sel-10 
cell lines expressed the protein as shown by Western blot analysis. Endogenous production 
of Ab 1-40 was increased in both 6myc-N-sel-10 cell lines in contrast to vector DNA 
transformed cells Table 3). In addition, stable expression of 6myc-N-sel-10 significantly 

30 increased Ab production after transfection with APP695NL-KK plasmid DNA (Table 3). 
Similar results were obtained with 6 stable cell lines expressing PS1-C-FLAG. All 6 cell 
lines showed significant elevation of endogenous AP processing and all also showed 
enhanced processing of Ab after transfection with APP695NL-KK (Table 3). In addition, 
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the increase of AJ5 processing seen with 6myc-N-sel-10 was also seen with sel-10 tagged at 
the C-terminus with either mychis or v5his (See Table 4). Both C-terminal and N-terminal 
tags resulted in an increase in A(3 processing. 

Discussion 

5 These data suggest that , when over expressed, 6myc-N-sel-10 as well as PS1-C- 

FLAG alter Ap 1 processing in both transient and stable expression systems. A 6-myc epitope 
tag was used in these experiments to allow detection of sel-10 protein expression by 
Western blot analysis. If as its sequence homology to yeast CDC4 suggests, sel-10 is an 
E2-E3 ubiquitin ligase, it should be possible to identify the proteins it targets for 

10 ubiquitination. Since the presenilins are degraded via the ubiquitin-proteasome pathway, 
PS1 & PS2 are logical targets of sel-10 catalyzed ubiquitination [Kim et al, J. Biol. Chem. 
272:11006-11010 (1997)]. How sel-10 affects A(3 processing is not understood at this 
point. In the future, it will be necessary to determine if sel-10 & PS1 increase A{3 processing 
by altering production, processing, transport, or turn-over of APP, and whether the effect of 

15 PS 1 is mediated or regulated by sel-10. 

These experiments suggest that sel-10 is a potential drug target for decreasing Ab 
levels in the treatment of AD. They also show that C. elegans is an excellent model system 
in which to investigate presenilin biology in the context of AD. Thus, as is shown by 
cotransfection experiments, as well as in stable transformants, expression of 6myc-N-sell0 

20 or PS1 -C-FLAG increases Ap" processing. An increase in A|3 processing was seen in both 
HEK293 cells and IMR32 cells after cotransfection of 6myc-N-sellO or PS 1 -C-FLAG with 
APP695NL-KK. In stable transformants of HEK293 cells expressing 6myc-SellO or PS1- 
C-FLAG, an increase in endogenous A(3 processing was observed, as well as an increase in 
Ap processing after transfection with APP695NL-KK. This suggests that inhibitors of either 

25 sel-10 and/or PS1, may decrease A(3 processing, and could have therappeutic potential for 
Alzheimer's disease. 

Example 4: SEL-10 Interacts with Presenilin 1, Facilitates Its Ubiquitination, and 
Alters AP Production 

30 

Mutations in the presenilin genes (PS1 or PS2) in man cause autosomal dominant 
early onset Alzheimer's disease. These have been linked to alterations in the processing of 
the amyloid protein precursor (APP) 1 ' 2 . The presenilins are membrane proteins with 6-8 
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transmembrane domains 3 ' 4 , which localize to the endoplasmic reticulum, Golgi complex, 
nuclear envelope, kinetochore and centrosome 5,6 . Mutations in sel-12, a Caenorhabditis 
elegans presenilin cause a defect in egg laying 7 . Sel-12 is one of three nematode presenilin 
homologous (sel-12, hop-1 and spe-4) 1S . The sel-12 mutant phenotype can be rescued by 
5 human PS 1 or PS 2 10 , indicating that PS1 , PS2 and sel-12 are functional homologues. 

Mutations in sel-12 cause a defect in egg laying by altering signaling through the Notch///«- 
12 pathway. The sel-12 mutant phenotype can be suppressed by loss of function mutations 
in a second gene, sel-10 u , which probably results in rescue of the egg laying defect by 
increasing the activity of a functionally redundant presenilin, hop-1 8 ' 9 . SEL-10 is a 

10 homologue of yeast Cdc4, a member of the SCF (Skpl-Cdc53/CULl-F-box protein) E2-E3 
ubiquitin ligase family 12 . In this study, we show that human SEL-10 interacts with PS1 and 
enhances PS1 ubiquitination, thus altering cellular levels of unprocessed PS1 and its N- and 
C-terminal fragments. This leads to an alteration in the metabolism of APP and to an 
increase in the production of amyloid pVpeptide, the principal component of amyloid plaque 

15 in Alzheimer's disease I3 . 

The SCF E2-E3 ubiquitin ligases contain a catalytic core consisting of Skpl, Rbxl, 
Cdc53/CUL-1 and an E2 ubiquitin transferase, Cdc34 14 . These are targeted to substrates 
for ubiquitination by adapter proteins (e.g., Cdc4, Grrl, Met30, (3-TrCP) containing an F- 
box motif and WD40 repeats 15 . There is evidence that presenilins are ubiquitinated and 

20 undergo degradation through the ubiquitin-proteasome pathway 16, 11 . Physical interaction 
between C. elegans SEL-10 and SEL-12 has been shown previously u . Thus, SEL-10 may 
be the F-box adaptor protein that recruits presenilins for ubiquitination and subsequent 
degradation. 

Both human and mouse orthologs of C. elegans sel-10 have been identified in EST 
25 databases, although the sequence information is incomplete 12 . We used rapid amplification 
of cDNA ends (RACE) to clone amplification products containing the full coding sequences 
for human and mouse sel-10. There are two variants of human sel-10 cDNA from 
hippocampus (hippocampus form) and mammary gland (common form) which differ in 5' 
sequence upstream of a common translation initiation site. This may indicate that the human 
30 sel-10 transcript undergoes differential splicing in a tissue-specific fashion or that the gene 
contains multiple, tissue specific promoters. The hippocampal form contains four in- frame 
methionines upstream of the common initiation site and the mammary form contains three. 
Whether or not these encode proteins with different N-termini is not known. The common 
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transcript is ubiquitously expressed in all tissues tested (Fig 2A), while the hippocampal form 
is present only in brain (Fig 2B). The human sel-10 gene was localized to chromosome 
4q31.2-31.3 by in situ hybridization (data not shown). The predicted protein sequences of 
human and C. elegans SEL-10 have 47.6% amino acid identity and 56.7% similarity. Human 

5 SEL- 10 contains an F-box domain as found in other SCF family members. It also contains 
seven WD40/pVtransducin repeats 18 as seen in yeast Cdc4p and C. elegans SEL-10, 
suggesting that the protein forms a seven-bladed propeller structure 19 . 

We first assessed the physical interaction of human SEL-10 and PS1. SEL-10 
tagged with an N-terminal 6-Myc epitope and PS1 were transiently co-expressed in human 

10 embryonic kidney cells (HEK293) and their interaction was assessed by immuno- 

precipitation. Complexes between SEL-10 and PS1 could only be detected in the presence 
of a proteasome inhibitor, lactacystin, which was added to the cultures at the time of 
transfection. When immunoprecipitated with anti-PS 1 loop antibody, only the immuno- 
preciptate from cotransfected cells contained SEL-10-myc (Fig 3A), indicating that SEL-10 

15 can interact with PS 1 . Since co-immunoprecipitation was not observed in cells without 
lactacystin treatment (Fig 3B), this suggests that the complex can only be captured by 
blocking the entry of PS 1 into the proteasome degradation pathway. The interaction 
between SEL-10 and PS1 was confirmed in the reverse experiment by using anti-myc 
antibody for immunoprecipitation of SEL-10 (Fig 3C). PS1 is cleaved within the 

20 cytoplasmic loop between transmembrane domains six and seven by an unknown protease 
which generates N- and C-terminal fragments (PS1-NTF and PS1-CTF, respectively). The 
SEL-10-myc immunoprecipitates contained primarily full length and high molecular weight 
forms of PS1, but only very low or undetectable amounts of PS1-NTF and no PS1-CTF. 
This suggests that SEL-10 may bind primarily to unprocessed PS1. 

25 Next, we examined the effect of SEL-10 co -transfection on PS1 ubiquitination. PS1 

immunoprecipitated from cotransfected cells contains a higher level of ubiquitination 
compared to cells transfected with PS1 alone as shown by probing with anti-ubiquitin 
antibody (Fig 4A). This result demonstrates that complex formation between SEL-10 and 
PS1 facilitates ubiquitination as implied previously by the need for lactacystin to demonstrate 

30 SEL- 10/PS1 complex accumulation. 

We then investigated how ubiquitination affects PS1 protein level. HEK293 cells 
were cotransfected with either SEL-10 or PS1 alone or in combination and 
immunoprecipitates were probed with anti-PS 1 loop antibody (Fig. 4B). In cells co- 
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transfected with PS1 and SEL-10 in comparison to cells transfected with PS1 alone, PS1- 
NTF and PS1-CTF were decreased as were the high molecular weight forms of PS1. 
However, the amount of unprocessed PS1 appeared to be slightly increased. There are 
several possible interpretations of this complex cellular effect. The first is that SEL-10 
5 mediated ubiquitination of PS 1 -CTF and PS 1 -NTF leads to their degradation. However, the 
fragments of PS1 are not immunoprecipitated with SEL-10 suggesting that SEL-10 does not 
directly facilitate ubiquitination of PS 1 -NTF and PS 1 -CTF. The slight increase in 
unprocessed PS1 seen in cells cotransfected with SEL-10 may suggest instead that SEL-10 
binding to PS1 or SEL-10 mediated ubiquitination of PS1 inhibits proteolytic processing of 

10 PS 1 -NTF and PS 1-CTF leading to accumulation of the unprocessed precursor. 

In order to examine the impact of SEL-10 mediated ubiquitination of PS1 on 
amyloid ^-peptide (Afi) production, SEL-10 was expressed by either transient or stable 
transfection in HEK293 cells with or without wild-type PS1. AP peptide production was 
measured by enzyme immunoassays that could distinguish the 1-40 and 1-42 forms of the 

15 peptide. In transient expression experiments (Fig. 5 A), coexpression of SEL-10 with APP 
increased production and release of Api-40 and AfJl-42 by more than 2-fold in comparison 
to APP expression alone. Transient coexpression of PS1 with APP also increased Api-40 
and Api-42 levels by more than 3-fold, similar to previous reports 20 although this is not a 
consistent finding 2 . Coexpression of SEL-10 and PS1 with APP had an additive effect with 

20 an increase in AP production of approximately 7-fold. No effect on the ratio of Afil- 
42/total Ap was observed with all values falling in the range 8.5% to 13.5%. The 
stimulation of A(5 processing due to expression of SEL-10 was consistently observed across 
experiments, however, the degree of stimulation did vary. To confirm and extend the result, 
a series of HEK293 cell lines were derived with stable expression of SEL-10 or PS1 (Fig. 

25 5B). The two SEL-10 cell lines obtained showed 4- and 7-fold increases in endogenous 
Api-40 peptide production compared to HEK293 cells transfected with just the pcDNA3.1 
vector. Similarly, six cell lines with stable expression of PS1 showed a 2- to 6-fold increase 
in endogenous Api-40 peptide production. Transient transfection of the stable cell lines 
with APP cDNA was performed to confirm and extend this result (Fig. 5C). Production of 

30 Ap was greater in the SEL-10 and PS1 cell lines in comparison to a control HEK293 cell 
line that had been transformed with just the pcDNA3.1 vector. Since SEL-10 decreases the 
cellular level of PS1-NTF and PS1-CTF fragments while slightly increasing unprocessed 
PS1, the increase in unprocessed PS1 may be associated with the increase in AP peptide 
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processing observed. Transient expression of PS1 has a similar effect. It increases the 
cellular level of unprocessed PS 1 while having relatively little effect on the levels of PS 1 - 
NTF and PS1-CTF fragments. 

Our data indicate that SEL-10 interacts with PS1, stimulates PS1 ubiquitination, and 

5 recruits it into the proteasome pathway for protein degradation. SEL- 10 is likely to function 
as an adaptor protein that assembles the core catalytic complex of an SCF E2-E3 ubiquitin 
ligase 14 . Recognition of most SCF substrates by F-box/WD40 repeat adaptor proteins is 
phosphorylation dependent l3 , suggesting that this may be an additional level of cellular 
regulation of presenilin levels. To demonstrate complex formation between human SEL-10 

10 and PS1 in HEK293 cells required inhibition of proteasome function by lactacystin. In 
contrast, the C. elegans SEL-10/SEL-12 complex accumulates in human HEK293 cells in 
the absence of proteasome inhibitors suggesting that nematode SEL-10 is unable to 
assemble the human core catalytic complex 11 . Degradation of an eight-pass transmembrane 
protein such as presenilin presents a topological hurdle since the presenilin protein must be 

15 extracted from the membrane and delivered to the proteasome. Like presenilins, a number 
of other multipass integral membrane proteins with large cytoplasmic domains such as the 
cystic fibrosis transmembrane conductance regulator (CFTR) are degraded through the 
ubiquitin-proteasome pathway 21 ' 22 . This pathway also is important for quality control 
within the endoplasmic reticulum 23 and conceivably could impact on intracellular production 

20 of A (3 peptide. 

Transfection of PS1 into human cells generally is observed to increase cellular levels 
of unprocessed PS1, but causes little change in levels of PS1-CTF and PS 1 -NTF suggesting 
that processing is under tight cellular control 11 , perhaps due to their involvement in 
apoptotic pathways 24 . As a consequence of increased transient or stable expression of PS1, 

25 we see an increase in AfJ peptide production that probably is due to the accumulation of 
unprocessed PS1. A similar effect on A(3 peptide production and accumulation of 
unprocessed PS1 is caused by increased expression of SEL-10. SEL-10 may directly 
regulate PS1 processing by binding to the cleavage site in the large cytoplasmic loop of PS1, 
or SEL-10 mediated ubiquitination may block PS1 processing. Similarly, a number of PS1 

30 mutations associated with familial Alzheimer's disease have been shown to decrease 

processing of PS1 into N- and C-terminal fragments, although that is not a consistent finding 
25 . However, mutant forms of PS1 are associated with a shift in the ratio of Api-42 and 
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Apl-40, whereas expression of wild-type PS1 or SEL-10 has no effect on the ratio of A(51- 
42/total Ap u 

The genetic data indicates that SEL-10 is a negative regulator of presenilin activity in 
C. elegans. Loss of SEL-10 function presumably rescues the egg laying defect in sel-12 

5 mutant worms through facilitation of HOP-1 presenilin activity, perhaps by allowing the 
increased accumulation of processed N- and C-terminal fragments of HOP-1. Sel-10 was 
identified in a screen for mutations that increase presenilin activity 11 . In principle, genetic 
screens in model organisms such as C. elegans or Drosophila can be used to find mutations 
that decrease presenilin activity, the desired therapeutic goal in Alzheimer's disease 26 . Such 

10 screens have the potential to identify novel therapeutic targets for this devastating disease. 

Methods 

Cloning. Incyte clone (028971) was identified as the human homologue of C. elegans sel- 
ls 10 and its sequence was used to design four antisense oligonucleotide primers 5'-TCACT- 
TC ATGTCC ACATC AAAGTCC-3 ' (SEQ ID NO: _), 5 ' -GGTAATTAC AAAGTTCTTG- 
TTGAACTG-3' (SEQ ID NO: _), 5 ' -CCCTGC AACGTGTGTAGAC AGG-3 ' (SEQ ID 
NO: _), and 5 ' -CC AGTCTCTGC ATTCC AC ACTTTG-3 ' (SEQ ID NO: _), to amplify 
the remainder of the human sel-10 sequence. "Electronic Northern" analysis revealed 
20 expression of sel-10 in hippocampus and mammary gland so these tissues were chosen for 
5'RACE cloning using Marathon kit (CloneTech). Marathon- ready cDNA from 
hippocampus and mammary gland were prepared as directed in the kit. PCR products were 
cloned into the TA vector pCR3.1 (Invitrogen), and isolates were sequenced. An alternate 
5' oligonucleotide primer was also designed based on Incyte clones that have 5' ends that 
25 differ from the hippocampal sel- 1 0 sequence (5 ' -CTC AG AC AGGTCAGGAC ATTTGG-3 ' 

(SEQ ID NO: ). Blastn was used to search the Incyte databases LifeSeq and LifeSeqFL. 

Gap alignments and translations were performed with GCG programs. 

Plasmids and transfections. The human sel-10 cDNA was inserted into the EcoRl site of 
30 the vector pCS2+MT (gift of Jan Kitajewski, Columbia University College of Physicians and 
Surgeons). This fused a 5' 6-myc epitope tag in- frame to the fifth methionine of the 
hippocampal sel-10 cDNA. The hippocampal and mammary sel-10 cDNA diverge upstream 
of this methionine. A PS1 cDNA with a 3 '-FLAG tag (PS1-C-FLAG) was inserted into the 
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pcDNA3.1 vector. An APP cDNA containing the Swedish KM -^NL mutation and an 
attenuated ER retention sequence consisting of a C-terminal di-lysine motif (APP695Sw- 
KK) was inserted into the pIRES-EGFP vector (Clontech). HEK293 cells were grown to 
80% confluence in DMEM with 10% FBS and transfected with the above cDNAs. A total 

5 of 10 (J.g DNA/6 x 10 6 cells was used for transfection with a single plasmid. For 

cotransfections of multiple plasmids, an equal amount of each plasmid was used for a total 
of 10 jig DNA using Lipofect Amine (BRL). Single cells were sorted into each well of one 
96 well plate containing growth medium without G418 by FACS using an EPICS Elite ESP 
flow cytometer (Coulter, Hialeah, FL) equipped with a 488 nm excitation line supplied by an 

10 air-cooled argon laser. After a four day recovery period, G418 was added to the medium to 
a final concentration of 400 ng/ml for the selection of stably transfected cell lines. Stable 
expression of a cDNA was confirmed by detection of the specific protein or sequence tag by 
Western blot. 



15 Immunoprecipitation, Western blot, and ELISA. AfJ and PS1 antibodies have been 

characterized 6 ' 21 . Cell lysates with equal amount of protein were precipitated with antibody 
and protein G-Sepharose at 4 °C for 2 hr and beads were washed three times with TENT 
buffer 11 . Immunoprecipitates were analyzed by Western blot using 4-12% NuPage Bis-Tris 
gel and PVDF membrane (Novex), peroxidase-conjugated secondary antibody (Vector) and 

20 SuperSignal West Pico Luminol/Enhancer ( Pierce). The ELISA for A(3 1-40 and Ap 1-42 
was performed as described 21 . 
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It will be clear that the invention may be practiced otherwise than as particularly 
described in the foregoing description and examples. 

Numerous modifications and variations of the present invention are possible in 

light of the above teachings and, therefore, are within the scope of the invention. 

The entire disclosure of all publications cited herein are hereby incorporated by 
reference. 
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What is claimed is: 

1 . An isolated nucleic acid molecule comprising a polynucleotide having a 
sequence at least 95% identical to a sequence selected from the group consisting of: 

(a) a nucleotide sequence encoding a human sel-10 polypeptide 
having the complete amino acid sequence selected from the group consisting of SEQ 
ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, and SEQ ID NO:7, or as 
encoded by the cDNA clone contained in ATCC Deposit No. 98978; 



having the complete amino acid sequence selected from the group consisting of SEQ 
ID NO:8, SEQ ID NO:9, and SEQ ID NO: 10, or as encoded by the cDNA clone 
contained in ATCC Deposit No. 98979; and 

(c) a nucleotide sequence complementary to the nucleotide 
sequence of (a) or (b). 

2. An isolated nucleic acid molecule comprising polynucleotide which 
hybridizes under stringent conditions to a polynucleotide having the nucleotide 
sequence in (a), (b), or (c) of claim 1. 

3. The nucleic acid molecule of claim 1, wherein said polynucleotide of 
1(a) encodes a human sel-10 polypeptide having the complete amino acid sequence of 
SEQ ID NO:3. 

4. The nucleic acid molecule of claim 3, wherein said polynucleotide 
molecule of 1(a) comprises the nucleotide sequence of residues 45-1928 of SEQ ID 
NO:l. 

5. The nucleic acid molecule of claim 1, wherein said polynucleotide of 
1(a) encodes a human sel-10 polypeptide having the complete amino acid sequence of 
SEQ ID NO:4. 



(b) 



a nucleotide sequence encoding a human sel-10 polypeptide 
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6. The nucleic acid molecule of claim 5, wherein said polynucleotide 
molecule of 1(a) comprises the nucleotide sequence of residues 150-1928 of SEQ ID 
NO:l. 

7. The nucleic acid molecule of claim 1, wherein said polynucleotide of 
1(a) encodes a human sel-10 polypeptide having the complete amino acid sequence of 
SEQ ID NO:5. 

8. The nucleic acid molecule of claim 7, wherein said polynucleotide 
molecule of 1(a) comprises the nucleotide sequence of residues 267-1928 of SEQ ID 
NO:l. 

9. The nucleic acid molecule of claim 1, wherein said polynucleotide of 
1(a) encodes a human sel-10 polypeptide having the complete amino acid sequence of 
SEQ ID NO:6. 

10 The nucleic acid molecule of claim 9, wherein said polynucleotide 
molecule of 1(a) comprises the nucleotide sequence of residues 291-1928 of SEQ ID 
NO:l. 

11. The nucleic acid molecule of claim 1, wherein said polynucleotide of 
1(a) encodes a human sel-10 polypeptide having the complete amino acid sequence of 
SEQ ID NO:7. 

12. The nucleic acid molecule of claim 1 1 , wherein said polynucleotide 
molecule of 1(a) comprises the nucleotide sequence of residues 306-1928 of SEQ ID 
NO:l. 

13. The nucleic acid molecule of claim 1, wherein said polynucleotide of 
1(b) encodes a human sel-10 polypeptide having the complete amino acid sequence of 
SEQ ID NO:8. 
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14. The nucleic acid molecule of claim 13 wherein said polynucleotide 
molecule of 1(b) comprises the nucleotide sequence of residues 180-1949 of SEQ ID 
NO:2. 

5 15. The nucleic acid molecule of claim 1 , wherein said polynucleotide of 

1(b) encodes a human sel-10 polypeptide having the complete amino acid sequence of 
SEQ ID NO:9. 

16. The nucleic acid molecule of claim 15, wherein said polynucleotide 

10 molecule of 1(b) comprises the nucleotide sequence of residues 270-1949 of SEQ ID 
NO:2. 

17. The nucleic acid molecule of claim 1, wherein said polynucleotide of 
1(b) encodes a human sel-10 polypeptide having the complete amino acid sequence of 

15 SEQ ID NO: 10. 

18. The nucleic acid molecule of claim 17, wherein said polynucleotide 
molecule of 1(b) comprises the nucleotide sequence of residues 327-1949 of SEQ ID 
NO:2. 

20 

19. A vector comprising the nucleic acid molecule of claim 1. 

20. The vector of claim 19, wherein said nucleic acid molecule of claim 1 is 
operably linked to a promoter for the expression of a sel-10 polypeptide. 

25 

21 . A host ceil comprising the vector of claim 19. 

22. The host cell of claim 21, wherein said host is a eukaryotic host. 

30 23. A method of obtaining a sel- 10 polypeptide comprising culturing the 

host cell of claim 22 and isolating said sel-10 polypeptide. 

24. An isolated sel-10 polypeptide comprising 
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(a) an amino acid sequence selected from the group consisting of 
SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, and SEQ ID NO:7, or 
as encoded by the cDNA clone contained in ATCC Deposit No. 98978; 

(b) an amino acid sequence selected from the group consisting of 
SEQ ID NO:8, SEQ ID NO:9, and SEQ ID NO: 10, or as encoded by the cDNA clone 
contained in ATCC Deposit No. 98979. 

25. The isolated sel-10 polypeptide of claim 24, wherein said polypeptide 
comprises the amino acid sequence of SEQ ID NO:3. 

26. The isolated sel-10 polypeptide of claim 24, wherein said polypeptide 
comprises the amino acid sequence of SEQ ID NO:4. 

27. The isolated sel-10 polypeptide of claim 24, wherein said polypeptide 
comprises the amino acid sequence of SEQ ID NO:5. 

28. The isolated sel-10 polypeptide of claim 24, wherein said polypeptide 
comprises the amino acid sequence of SEQ ID NO:6. 

29. The isolated sel-10 polypeptide of claim 24, wherein said polypeptide 
comprises the amino acid sequence of SEQ ID NO:7. 

30. The isolated sel-10 polypeptide of claim 24, wherein said polypeptide 
comprises the amino acid sequence of SEQ ID NO: 8. 

31. The isolated sel-10 polypeptide of claim 24, wherein said polypeptide 
comprises the amino acid sequence of SEQ ID NO:9. 

32. The isolated sel-10 polypeptide of claim 24, wherein said polypeptide 
comprises the amino acid sequence of SEQ ID NO: 10. 

33. An isolated antibody that binds specifically to the sel-10 polypeptide of 
claim 24. 
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34. A cell line having altered AJ3 processing that expresses any of the sel- 
10 isolated nucleic acid molecules of claim 1. 

35. The cell line of claim 34, wherein said Ap processing is increased. 

36. The cell line of claim 34, wherein said AP processing is decreased. 

37. The cell line of claim 34, wherein said cell line is 6myc-N-sellO/2. 

38. The cell line of claim 34, wherein said cell line is 6myc-N-sell0/6. 

39. A method for the identification of an agent capable of altering the ratio 
of Ap M0 / Aj3 M0 + Ap\_4 2 produced in any of the cell lines of claims 34, 37, and 38, 
comprising the steps of: 

(a) obtaining a test culture and a control culture of said cell line; 

(b) contacting said test culture with a test agent; 

(c) measuring the levels of Ap\_4 0 and Ap^ 2 produced by said test 
culture of step (b) and said control culture; 

(d) calculating the ratio of APiW Af^o + AfW for said test 
culture and said control culture from the levels of Af3i^ 0 and AfW measured in step 
(c); and 

(e) comparing the ratio of A$ 1A0 / Ap^o + Ap^ 2 measured for said 
test culture and said control culture in step (d); 

whereby a determination that the ratio of AP^o/ Ap\_4 0 + Ap^for said test culture is 
higher or lower than ratio of Afi^o/ Ap M0 + Ap^ 2 for said control culture indicates 
that said test agent has altered the ratio of Ap^o/ APi_4 0 + Api_4 2 . 

40. The method of claim 39, wherein said ratio of ApiV AP^ 0 + Afl^ is 
increased by said test agent. 

41 . The method of claim 39, wherein said ratio of Aft V APi_ 40 + AP^ 2 is 
decreased by said test agent. 
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Human Sel-10 Polypeptides and Polynucleotides that Encode Them 

The present invention provides isolated nucleic acid molecules comprising a 
polynucleotide encoding either of two alternative splice variants of human sel-10, one 
5 of which is expressed in hippocampal cells, and one of which is expressed in mammary 
cells. The invention also provides isolated sel-10 polypeptides and cell lines which 
express them in which A{3 processing is altered. 
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FIGURES 1 A AND IB. Western blot of protein expression in HEK293 cells 
transfected with PS1-C-FLAG, 6-mye-N-sel-10, and APP295NL-KK cDNAs. 
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SEQUENCE LISTING 



<110> Gurney, Mark E. 
Li , J inhe 
Pauley, Adele M. 
Pharmacia & Upjohn Company 

<120> Human Sel-10 Polypeptides and Polynucleotides that 
Encode Them 

<130> 6142 

<140> 6142 
<141> 1997-12-19 

<160> 27 

<170> Patentln Ver. 2.0 

<210> 1 

<211> 3550 

<212> DNA 

<213> Homo sapiens 



ctcattattc 


cctcgagttc 


ttctcagtca 


agctgcatgt 


atgtatgtgt 


gtcccgagaa 


60 


gcggtttgat 


actgagctgc 


atttgccttt 


actgtggagt 


tttgttgccg 


gttctgctcc 


120 


ctaatcttcc 


ttttctgacg 


tgcctgagca 


tgtccacatt 


agaatctgtg 


acatacctac 


180 


ctgaaaaagg 


tttatattgt 


cagagactgc 


caagcagccg 


gacacacggg 


ggcacagaat 


240 


cactgaaggg 


gaaaaataca 


gaaaatatgg 


gtttctacgg 


cacattaaaa 


atgatttttt 


300 


acaaaatgaa 


aagaaagttg 


gaccatggtt 


ctgaggtccg 


ctctttttct 


ttgggaaaga 


360 


aaccatgcaa 


agtctcagaa 


tatacaagta 


ccactgggct 


tgtaccatgt 


tcagcaacac 


420 


caacaacttt 


tggggacctc 


agagcagcca 


atggccaagg 


gcaacaacga 


cgccgaatta 


480 


catctgtcca 


gccacctaca 


ggcctccagg 


aatggctaaa 


aatgtttcag 


agctggagtg 


540 


gaccagagaa 


attgcttgct 


ttagatgaac 


tcattgatag 


ttgtgaacca 


acacaagtaa 


600 


aacatatgat 


gcaagtgata 


gaaccccagt 


ttcaacgaga 


cttcatttca 


ttgctcccta 


660 



<400> 



1 



1 





• 




nrri gc 


actctatgtg 




agacatgtcg 


ctactggaga 


attttggctg 


aagaagaggg 


ga t tga.tgaa 


ccattgcaca 


tcatacacag 


tccatggaaa 


agtgcataca 


ggC Q3.gQ3.Q3. 


acteddate t 


cc taaggtgc 


CfC t ta.Cagt t 


t tgtggta.ac 


cgaatagtta 


CfCJ" t C 3.Q C 3 Cf t 


cacaggca.aa 


tgtctgagaa 


ca.tca.caaa.t 


ga.gaga.caac 


atcatcatta 


ggaatgcaga. 


gac t ggaga.a 


fcgtatacaca 


gtatgcatct 


t ca.tgaaaa.a 


agagttgtta 


gggatattga 


ga.ca.ggccag 


tgtttacatg 


gtgttcaata 


tga tggcagg 


agggttgtta 


ggg<3- t c c aga. 


ga.c tgaa.acc 


tgtctacaca 


ca.ttaca.gtt 


tga tgg ta.t c 


cat g tgg tga 


ggg<a t g t gga. 


ga.c aggga.a t 


tgcattcaca 


gaatggaact 


ca.aa.gaca.at 


attcttgtct 


ggga.t ate aa. 


aa.c aggac ag 


tgt t tacaaa 


c tg tgacc tg 


t ttacagttc 


aac aagaac t 


ta.aa.a.c ta.t g 


ggac 1 1 ga.aa. 


aegggt gaat 


gggggagtgg 


gggagttgtg 


tggeggatea 




lHaglllll 


gaag aaac c a 








tt ccccc tgc 


aaaaagaaaa 


aaagaaaaga 


gcaggatg t t 


ggcttggggc 


aacagattga 


aagagatgac 


aaaccataac 


tgacaagaga 


tcacttttga 


c t gagggc ag 


etttgeaaaa 


tatttct t ta. 


ttttcttctc 


cag tgg teat 


tUtalclll 


tagcctgttc 






acaagtac tg 


gttgac tttc 


1 1 tt t c tgga 


gtggaaaagc 


ttaaaaaaat 


gaaaagcttt 


tttttttatt 


ttttngccaa 


agcctctgtt 


aatctattta 


ctgttgcttc 


aaaggtggca 


agttgtcc tg 


ggttctgtga 


gtgctagaag 


taggtcttca 


aatatgggat 




tggaacccaa 


agacctgeta 


caagcagctc 


720 


aagacaacct 


tctctggaga 


gagaaatgea 


780 


tcaagagaag 


aaaagtaata 


aaaccaggtt 


840 


tcagacagca 


cagaattgat 


actaactgga 


900 


tgaaaggaca 


tgatgatcat 


gtgatcacat 


950 


gtggttctga 


tgacaacact 


ttaaaagttt 


1020 


cattagtggg 


acatacaggt 


ggagtatggt 


1080 


gtggatctac 


agateggaca 


ctcaaagtgt 


1140 


ccttatatgg 


gcatacttcc 


actgtgcgtt 


1200 


gcggttctcg 


agatgecact 


cttagggttt 


1260 


ttttgatggg 


tcatgttgca 


gcagtccgct 


1320 


gtggagcata 


tgattttatg 


gtaaaggtgt 


1380 


cgttgcaggg 


gcatactaat 


agagtctatt 


1440 


gtggatctct 


tgatacatca 


atccgtgttt 


1500 


cgttaacagg 


gcaccagtcg 


ttaacaagtg 


1560 


ctgggaatgc 


agattctaca 


gttaaaatct 


1620 


cattgeaagg 


tcccaacaag 


catcagagtg 


1680 


ttgtaattac 


cagctcagat 


gatggaactg 


1740 


ttattcgaaa 


cctagtcaca 


ttggagagtg 


1800 


gagectcaaa 


cacaaagctg 


gtgtgtgcag 


1860 


agctgctggt 


gctggacttt 


gatgtggaca 


1920 


ccaattgtgt 


agacgatata 


ctccctgccc 


1980 


aaaagaaaaa 


aatcccttgt 


tctcagtggt 


2040 


aaagacctac 


agactaagaa 


ggaaaagaag 


2100 


ggegtctget 


gtctcatcac 


ataaaaggct 


2160 


tgagactttc 


taaatcaaac 


caggtgcaat 


2220 


tggggcagtg 


ttaatgctga 


aacatcatta 


2280 


gacagctaga 


cacctagaaa 


ggaactgcaa 


2340 


taattagaga 


geatctgeaa 


caaaaagtca 


2400 


tactgtgaat 


tgtttttgta 


cagttatcat 


2460 


ccattgccaa 


tgtcaatcaa 


tcacagtatt 


2520 


catatacatt 


ettcaatgea 


tatgttgctc 


2580 


gtcctgagat 


ggatttaatt 


ettgatgetg 


2640 


tgttgtccca 


accctgtact 


gtactcccag 


2700 



2 



tggccaaact tatttatgct gctaaatgaa 
tttttttctg ctgtgacgtt ttagtcccag 
atggaaaaaa gactttttgc cactgaaact 
atggaagagt ttcagataat aaagagtgaa 
gtgcaaagct tattttcttt tatctgggca 
tattacttgc ctgttctgtg gagaaacttt 
cgtacatcgt aaaactgaca aaatatttta 
gttgctggtc agtcttagtg ttttacagta 
ctctgagtaa ctgacaaagc agaaactatt 
acaaacaaaa tgaatgaaac agtcaaatgg 
agctgaactg tgaaagtggt ttaacactgt 
ttattttttt gnttgtttta tttatagtct 
attttagtta tgtaacaacc tgacatgatg 
tatggtttga ttcacttaga aattttattt 
ttttcatgtt 



agaaagaaaa aagcaaatta ttttttttat 2760 
actgaattcc aaatttgctc tagtttggtt 282 0 
tgagccatct gtgcctctaa gaggctgaga 2880 
gtttgcctgc aagtaaagaa ttgagagtgt 2940 
aaaattaaaa cacattcctt ggaacagagc 3 00 0 
tctttttgag ggctgtggtg aatggatgaa 3 060 
aaaatatata aaacacaaaa ttaaaataaa 312 0 
tttgggaaaa caactgttac agttttattg 3180 
cagtttttgt agtaaaggcg tcacatgcaa 3240 
tttgcctcat tctccaagag ccacaactca 33 00 
atcctaggcg atcttttttc ctccttctgt 33 60 
gatttaaaac aatcagattc aagttggtta 342 0 
gaggaaaaca acctttaaag ggattgtgtc 3480 
tcttataact taagtgcaat aaaatgtgtt 3 540 
3550 



<210> 2 

<211> 3571 

<212> DNA 

<213> Homo sapiens 



<400> 2 

ctcagcaggt caggacattt ggtaggggaa 
ttctcagcct tttaaaaact attattaaat 
tttagtaatg tgcctgtatt acatgtagag 
tgtcaaaacc gggaaaacct actctaaacc 
caaaagagcc tctaccacat caaaccgtga 
aaggcctccc tttttgtcga agacggatga 
gctctttttc tttgggaaag aaaccatgca 
ttgtaccatg ttcagcaaca ccaacaactt 
ggcaacaacg acgccgaatt acatctgtcc 
aaatgtttca gagctggagt ggaccagaga 
gttgtgaacc aacacaagta aaacatatga 
acttcatttc attgctccct aaagagttgg 
aagacctgct acaagcagct cagacatgtc 



ggttgaaaga caaaagcagc aggccttggg 6 0 
atatattttt aaaatttagt ggttagagct 12 0 
agtattcgtc aaccaagagg agttttaaaa 18 0 
atggcttggt tcctgttgat cttaaaagtg 24 0 
tgaagatatt tagcattagc atcattgccc 3 00 
aaagaaagtt ggaccatggt tctgaggtcc 3 60 
aagtctcaga atatacaagt accactgggc 42 0 
ttggggacct cagagcagcc aatggccaag 48 0 
agccacctac aggcctccag gaatggctaa 54 0 
aattgcttgc tttagatgaa ctcattgata 600 
tgcaagtgat agaaccccag tttcaacgag 660 
cactctatgt gctttcattc ctggaaccca 720 
gctactggag aattttggct gaagacaacc 780 





• 




1 1 c t c t ggag 


agagaaatgc 


aaagaagagg 


gaaaagtaat 


aaaaccaggt 


ttcatacaca 


acagaattga 


tac taac tgg 


ag gc gaggag 


at gat gat ca 


tgtgatcaca 


tgcttacagt 


atgacaacac 


tttaaaagtt 


tggtcagcag 


gacatacagg 


tggagtatgg 


tcatcacaaa 


cagatcggac 


actcaaagtg 


tggaatgcag 


ggcatac t tc 


cactgtgcgt 


tgtatgcatc 


gagatgccac 


tcttagggtt 


tgggatat tg 


gtcatgttgc 


agcagtccgc 


tgtgttcaat 


atgattttat 


ggtaaaggtg 


tgggatccag 


ggcatac taa 


tagagtctat 


tcattacagt 


ttgatacatc 


aatccgtgtt 


tgggatgtgg 


ggc ac c ag t c 


gttaacaagt 


ggaatggaac 


cagattctac 


agttaaaatc 


tgggatatca 


gtcccaacaa 


gcatcagagt 


gctgtgacct 


ccagctcaga 


tgatggaact 


gtaaaactat 


acctagtcac 


at tggagagt 


ggygggagtg 


acacaaagct 


ggtgtgtgca 


gttgggagtc 


tgctggactt 


tgatgtggac 


atgaagtgaa 


tagacgatat 


actccctgcc 


cttccccctg 


aaatcccttg 


ttctcagtgg 


tgcaggatgt 


cagac taaga 


aggaaaagaa 


gaagaga t ga 


tgtctcatca 


cataaaaggc 


ttcacttttg 


ctaaatcaaa 


ccaggtgcaa 


ttatttcttt 


gttaatgctg 


aaacatcatt 


acagattctg 


acacctagaa 


aggaac tgca 


ataatatcaa 


agcatc tgca 


acaaaaagtc 


atttttctgg 


ttgtttttgt 


acagttatca 


tgaaaagctt 


atgtcaatca 


atcacagtat 


tagcctc tgt 


tcttcaatgc 


atatgttgct 


caaaggtggc 


tggatttaat 


tcttgatgct 


ggtgctagaa 


aaccctgtac 


tgtactccca 


gtggccaaac 


aaagcaaatt 


atttttttta 


ttttttttct 




ggattgatga 


accattgcac 


atcaagagaa 


840 


gtccatggaa 


aagtgcatac 


atcagacagc 


900 


aactcaaatc 


tcctaaggtg 


ctgaaaggac 


960 


tttgtggtaa 


ccgaatagtt 


agtggttctg 


1020 


tcacaggcaa 


atgtctgaga 


acattagtgg 


1080 


tgagagacaa 


catcatcatt 


agtggatcta 


1140 


agactggaga 


atgtatacac 


accttatatg 


1200 


ttcatgaaaa 


aagagttgtt 


agcggttctc 


1260 


agacaggcca 


gtgtttacat 


gttttgatgg 


1320 


atgatggcag 


gagggttgtt 


agtggagcat 


1380 


agactgaaac 


ctgtctacac 


acgttgcagg 


1440 


ttgatggtat 


ccatgtggtg 


agtggatctc 


1500 


agacagggaa 


ttgcattcac 


acgttaacag 


1560 


tcaaagacaa 


tattcttgtc 


tctgggaatg 


1620 


aaacaggaca 


gtgtttacaa 


acattgcaag 


1680 


gtttacagtt 


caacaagaac 


tttgtaatta 


1740 


gggacttgaa 


aacgggtgaa 


tttattcgaa 


1800 


ggggagttgt 


gtggcggatc 


agagcctcaa 


1860 


ggaatgggac 


tgaagaaacc 


aagctgctgg 


1920 


gagcagaaaa 


gatgaatttg 


tccaattgtg 


1980 


caaaaagaaa 


aaaagaaaag 


aaaaagaaaa 


2040 


tggcttgggg 


caacagattg 


aaaagaccta 


2100 


caaaccataa 


ctgacaagag 


aggcgtctgc 


2160 


actgagggca 


gctttgcaaa 


atgagacttt 


2220 


attttcttct 


ccagtggtca 


ttggggcagt 


2280 


ctagcctgtt 


cttttaccac 


tgacagctag 


2340 


aacaagtact 


ggttgacttt 


ctaattagag 


2400 


agtggaaaag 


cttaaaaaaa 


ttactgtgaa 


2460 


ttttttttat 


tttttngcca 


accattgcca 


2520 


taatctattt 


actgttgctt 


ccatatacat 


2580 


aagttgtcct 


gggttctgtg 


agtcctgaga 


2640 


gtaggtcttc 


aaatatggga 


ttgttgtccc 


2700 


ttatttatgc 


tgctaaatga 


aagaaagaaa 


2760 


gctgtgacgt 


tttagtccca 


gactgaattc 


2820 



caaatttgct 


ctagtttggt 


tatggaaaaa 


tgtgcctcta 


agaggctgag 


aatggaagag 


caagtaaaga 


attgagagtg 


tgtgcaaagc 


acacattcct 


tggaacagag 


ctattacttg 


gggctgtggt 


gaatggatga 


acgtacatcg 


aaaacacaaa 


attaaaataa 


agttgctggt 


acaactgtta 


cagttttatt 


gctctgagta 


tagtaaaggc 


gtcacatgca 


aacaaacaaa 


ttctccaaga 


gccacaactc 


aagctgaact 


gatctttttt 


cctccttctg 


tttatttttt 


caatcagatt 


caagttggtt 


aattttagtt 


aacctttaaa 


gggattgtgt 


ctatggtttg 


ttaagtgcaa 


taaaatgtgt 


tttttcatgt 



agactttttg ccactgaaac ttgagccatc 2880 
tttcagataa taaagagtga agtttgcctg 2 94 0 
ttattttctt ttatctgggc aaaaattaaa 3000 
cctgttctgt ggagaaactt ttctttttga 3 060 
taaaactgac aaaatatttt aaaaatatat 312 0 
cagtcttagt gttttacagt atttgggaaa 318 0 
actgacaaag cagaaactat tcagtttttg 324 0 
atgaatgaaa cagtcaaatg gtttgcctca 33 00 
gtgaaagtgg tttaacactg tatcctaggc 33 6 0 
tgnttgtttt atttatagtc tgatttaaaa 342 0 
atgtaacaac ctgacatgat ggaggaaaac 3480 
attcacttag aaattttatt ttcttataac 3540 
t 3571 



<210> 3 
<211> 627 
<212> PRT 

<213> Homo sapiens 



<400> 3 

Met Cys Val Pro Arg Ser Gly Leu lie Leu Ser Cys lie Cys Leu Tyr 
15 10 15 



Cys Gly Val Leu 
20 

Cys Leu Ser Met 
35 



Leu Pro Val Leu 

Ser Thr Leu Glu 
40 



Leu Pro Asn Leu 
25 

Ser Val Thr Tyr 



Pro Phe Leu Thr 
30 

Leu Pro Glu Lys 
45 



Gly Leu Tyr Cys Gin Arg Leu Pro Ser Ser Arg Thr His Gly Gly Thr 
50 55 60 



Glu Ser Leu Lys Gly Lys Asn Thr Glu Asn Met Gly Phe Tyr Gly Thr 



Leu Lys Met lie Phe Tyr Lys Met Lys Arg Lys Leu Asp His Gly Ser 
85 90 95 



Glu Val Arg Ser 
100 

Tyr Thr Ser Thr 
115 

Phe Gly Asp Leu 
130 

He Thr Ser Val 
145 

Phe Gin Ser Trp 

He Asp Ser Cys 
180 

Glu Pro Gin Phe 
195 

Ala Leu Tyr Val 
210 

Ala Gin Thr Cys 
225 

Trp Arg Glu Lys 
Lys Arg Arg Lys 



Phe Ser Leu Gly 

Thr Gly Leu Val 
120 

Arg Ala Ala Asn 
135 

Gin Pro Pro Thr 

150 

Ser Gly Pro Glu 
165 

Glu Pro Thr Gin 



Gin Arg Asp Phe 
200 

Leu Ser Phe Leu 
215 

Arg Tyr Trp Arg 

230 

Cys Lys Glu Glu 
245 

Val He Lys Pro 



Lys Lys Pro Cys 
105 

Pro Cys Ser Ala 

Gly Gin Gly Gin 
140 

Gly Leu Gin Glu 
155 

Lys Leu Leu Ala 
170 

Val Lys His Met 
185 

He Ser Leu Leu 



Glu Pro Lys Asp 
220 

He Leu Ala Glu 
235 

Gly He Asp Glu 
250 

Gly Phe He His 



Lys Val Ser Glu 
110 

Thr Pro Thr Thr 
125 

Gin Arg Arg Arg 

Trp Leu Lys Met 
160 

Leu Asp Glu Leu 
175 

Met Gin Val He 
190 

Pro Lys Glu Leu 
205 

Leu Leu Gin Ala 



Asp Asn Leu Leu 
240 

Pro Leu His He 
255 

Ser Pro Trp Lys 



Ser Ala Tyr lie Arg Gin His Arg lie Asp Thr Asn Trp Arg Arg Gly 
275 280 285 



Glu Leu Lys Ser Pro Lys Val Leu Lys Gly His Asp Asp His Val lie 
290 295 300 



Thr Cys Leu Gin Phe Cys Gly Asn Arg lie Val Ser Gly Ser Asp Asp 
305 310 315 320 



Asn Thr Leu Lys Val Trp Ser Ala Val Thr Gly Lys Cys Leu Arg Thr 
325 330 335 



Leu Val Gly His Thr Gly Gly Val Trp Ser Ser Gin Met Arg Asp Asn 
340 345 350 



lie lie lie Ser Gly Ser Thr Asp Arg Thr Leu Lys Val Trp Asn Ala 
355 360 365 



Glu Thr Gly Glu Cys lie His Thr Leu Tyr Gly His Thr Ser Thr Val 
370 375 380 



Arg Cys Met His Leu His Glu Lys Arg Val Val Ser Gly Ser Arg Asp 
385 390 395 400 



Ala Thr Leu Arg Val Trp Asp lie Glu Thr Gly Gin Cys Leu His Val 
405 410 415 



Leu Met Gly His Val Ala Ala Val Arg Cys Val Gin Tyr Asp Gly Arg 
420 425 430 



Arg Val Val Ser Gly Ala Tyr Asp Phe Met Val Lys Val Trp Asp Pro 
435 440 445 



7 




Glu Thr Glu Thr Cys 
450 

Tyr Ser Leu Gin Phe 
465 

Thr Ser lie Arg Val 
485 

Leu Thr Gly His Gin 
500 

He Leu Val Ser Gly 
515 

Lys Thr Gly Gin Cys 
530 

Ser Ala Val Thr Cys 
545 

Ser Asp Asp Gly Thr 
565 

He Arg Asn Leu Val 
580 

Trp Arg He Arg Ala 
595 

Arg Asn Gly Thr Glu 
610 



Leu His Thr Leu Gin Gly 
455 

Asp Gly He His Val Val 
470 475 

Trp Asp Val Glu Thr Gly 
490 

Ser Leu Thr Ser Gly Met 
505 

Asn Ala Asp Ser Thr Val 
520 

Leu Gin Thr Leu Gin Gly 
535 

Leu Gin Phe Asn Lys Asn 
550 555 

Val Lys Leu Trp Asp Leu 
570 

Thr Leu Glu Ser Gly Gly 
585 

Ser Asn Thr Lys Leu Val 
600 

Glu Thr Lys Leu Leu Val 
615 




His Thr Asn Arg Val 
460 

Ser Gly Ser Leu Asp 
480 

Asn Cys He His Thr 
495 

Glu Leu Lys Asp Asn 
510 

Lys He Trp Asp He 
525 

Pro Asn Lys His Gin 
540 

Phe Val He Thr Ser 
560 

Lys Thr Gly Glu Phe 
575 

Ser Gly Gly Val Val 
590 

Cys Ala Val Gly Ser 
605 

Leu Asp Phe Asp Val 
620 



Asp Met Lys 




625 

<210> 4 

<211> 592 

<212> PRT 

<213> Homo sapiens 

<400> 4 

Met Ser Thr Leu Glu Ser Val Thr Tyr Leu Pro Glu Lys Gly Leu Tyr 
15 10 15 

Cys Gin Arg Leu Pro Ser Ser Arg Thr His Gly Gly Thr Glu Ser Leu 
20 25 30 

Lys Gly Lys Asn Thr Glu Asn Met Gly Phe Tyr Gly Thr Leu Lys Met 
35 40 45 

lie Phe Tyr Lys Met Lys Arg Lys Leu Asp His Gly Ser Glu Val Arg 
50 55 60 

Ser Phe Ser Leu Gly Lys Lys Pro Cys Lys Val Ser Glu Tyr Thr Ser 
65 70 75 80 

Thr Thr Gly Leu Val Pro Cys Ser Ala Thr Pro Thr Thr Phe Gly Asp 
85 90 95 

Leu Arg Ala Ala Asn Gly Gin Gly Gin Gin Arg Arg Arg lie Thr Ser 
100 105 110 

Val Gin Pro Pro Thr Gly Leu Gin Glu Trp Leu Lys Met Phe Gin Ser 
115 120 125 

Trp Ser Gly Pro Glu Lys Leu Leu Ala Leu Asp Glu Leu lie Asp Ser 
130 135 140 

9 



Cys Glu Pro Thr Gin Val Lys His Met Met Gin Val lie Glu Pro Gin 
145 150 155 160 



Phe Gin Arg Asp Phe lie Ser Leu Leu Pro Lys Glu Leu Ala Leu Tyr 
165 170 175 



Val Leu Ser Phe Leu Glu Pro Lys Asp Leu Leu Gin Ala Ala Gin Thr 
180 185 190 



Cys Arg Tyr Trp Arg lie Leu Ala Glu Asp Asn Leu Leu Trp Arg Glu 
195 200 205 



Lys Cys Lys Glu Glu Gly lie Asp Glu Pro Leu His lie Lys Arg Arg 
210 215 220 



Lys Val lie Lys Pro Gly Phe lie His Ser Pro Trp Lys Ser Ala Tyr 
225 230 235 240 



lie Arg Gin His Arg lie Asp Thr Asn Trp Arg Arg Gly Glu Leu Lys 
245 250 255 



Ser Pro Lys Val Leu Lys Gly His Asp Asp His Val lie Thr Cys Leu 
260 265 270 



Gin Phe Cys Gly Asn Arg lie Val Ser Gly Ser Asp Asp Asn Thr Leu 
275 280 285 



Lys Val Trp Ser Ala Val Thr Gly Lys Cys Leu Arg Thr Leu Val Gly 
290 295 300 



His Thr Gly Gly Val Trp Ser Ser Gin Met Arg Asp Asn He He He 
305 310 315 320 



10 



Ser Gly Ser Thr Asp Arg Thr Leu Lys Val Trp Asn Ala Glu Thr Gly 
325 330 335 



Glu Cys lie His Thr Leu Tyr Gly His Thr Ser Thr Val Arg Cys Met 
340 345 350 



His Leu His Glu Lys Arg Val Val Ser Gly Ser Arg Asp Ala Thr Leu 
355 360 365 



Arg Val Trp Asp lie Glu Thr Gly Gin Cys Leu His Val Leu Met Gly 
370 375 380 



His Val Ala Ala Val Arg Cys Val Gin Tyr Asp Gly Arg Arg Val Val 
385 390 395 400 



Ser Gly Ala Tyr Asp Phe Met Val Lys Val Trp Asp Pro Glu Thr Glu 
405 410 415 



Thr Cys Leu His Thr Leu Gin Gly His Thr Asn Arg Val Tyr Ser Leu 
420 425 430 



Gin Phe Asp Gly lie His Val Val Ser Gly Ser Leu Asp Thr Ser He 
435 440 445 



Arg Val Trp Asp Val Glu Thr Gly Asn Cys He His Thr Leu Thr Gly 
450 455 460 



His Gin Ser Leu Thr Ser Gly Met Glu Leu Lys Asp Asn He Leu Val 
465 470 475 480 



Ser Gly Asn Ala Asp Ser Thr Val Lys He Trp Asp He Lys Thr Gly 
485 490 495 



Gin Cys Leu Gin Thr Leu Gin Gly Pro Asn Lys His Gin Ser Ala Val 
500 505 510 



11 



Thr Cys Leu Gin 

515 

Gly Thr Val Lys 
530 

Leu Val Thr Leu 
545 

Arg Ala Ser Asn 



Thr Glu Glu Thr 
580 



Phe Asn Lys Asn 
520 

Leu Trp Asp Leu 
535 

Glu Ser Gly Gly 
550 

Thr Lys Leu Val 
565 

Lys Leu Leu Val 



Phe Val lie Thr 



Lys Thr Gly Glu 
540 

Ser Gly Gly Val 

555 

Cys Ala Val Gly 
570 

Leu Asp Phe Asp 
585 



Ser Ser Asp Asp 
525 

Phe lie Arg Asn 



Val Trp Arg lie 
560 

Ser Arg Asn Gly 
575 

Val Asp Met Lys 
590 



<210> 5 
<211> 553 
<212> PRT 

<213> Homo sapiens 
<400> 5 

Met Gly Phe Tyr Gly Thr Leu Lys Met lie Phe Tyr Lys Met Lys Arg 
15 10 15 

Lys Leu Asp His Gly Ser Glu Val Arg Ser Phe Ser Leu Gly Lys Lys 
20 25 30 



Pro Cys Lys Val Ser Glu Tyr Thr Ser Thr Thr Gly Leu Val Pro Cys 



Ser Ala Thr Pro Thr Thr Phe Gly Asp Leu Arg Ala Ala Asn Gly Gin 
50 55 60 



Gly Gin Gin Arg 
65 

Gin Glu Trp Leu 

Leu Ala Leu Asp 
100 

His Met Met Gin 
115 

Leu Leu Pro Lys 
13 0 

Lys Asp Leu Leu 
145 

Ala Glu Asp Asn 

Asp Glu Pro Leu 
180 

lie His Ser Pro 
195 

Thr Asn Trp Arg 
210 

His Asp Asp His 



Arg Arg lie Thr 
70 

Lys Met Phe Gin 
85 

Glu Leu lie Asp 

Val lie Glu Pro 
120 

Glu Leu Ala Leu 
135 

Gin Ala Ala Gin 
150 

Leu Leu Trp Arg 
165 

His lie Lys Arg 

Trp Lys Ser Ala 
200 

Arg Gly Glu Leu 
215 

Val lie Thr Cys 



Ser Val Gin Pro 
75 

Ser Trp Ser Gly 
90 

Ser Cys Glu Pro 
105 

Gin Phe Gin Arg 

Tyr Val Leu Ser 
140 

Thr Cys Arg Tyr 
155 

Glu Lys Cys Lys 
170 

Arg Lys Val lie 

185 

Tyr lie Arg Gin 

Lys Ser Pro Lys 
220 

Leu Gin Phe Cys 



Pro Thr Gly Leu 
80 

Pro Glu Lys Leu 
95 

Thr Gin Val Lys 
110 

Asp Phe lie Ser 
125 

Phe Leu Glu Pro 



Trp Arg lie Leu 
160 

Glu Glu Gly lie 
175 

Lys Pro Gly Phe 
190 

His Arg lie Asp 
205 

Val Leu Lys Gly 
Gly Asn Arg lie 



13 



Val Ser Gly Ser Asp Asp Asn Thr Leu Lys Val Trp Ser Ala Val Thr 
245 250 255 



Gly Lys Cys Leu Arg Thr Leu Val Gly His Thr Gly Gly Val Trp Ser 
260 265 270 



Ser Gin Met Arg Asp Asn lie lie lie Ser Gly Ser Thr Asp Arg Thr 
275 280 285 



Leu Lys Val Trp Asn Ala Glu Thr Gly Glu Cys lie His Thr Leu Tyr 
290 295 300 



Gly His Thr Ser Thr Val Arg Cys Met His Leu His Glu Lys Arg Val 
305 310 315 320 



Val Ser Gly Ser Arg Asp Ala Thr Leu Arg Val Trp Asp lie Glu Thr 
325 330 335 



Gly Gin Cys Leu His Val Leu Met Gly His Val Ala Ala Val Arg Cys 
340 345 350 



Val Gin Tyr Asp Gly Arg Arg Val Val Ser Gly Ala Tyr Asp Phe Met 
355 360 365 



Val Lys Val Trp Asp Pro Glu Thr Glu Thr Cys Leu His Thr Leu Gin 
370 375 380 



Gly His Thr Asn Arg Val Tyr Ser Leu Gin Phe Asp Gly He His Val 
385 390 395 400 



Val Ser Gly Ser Leu Asp Thr Ser He Arg Val Trp Asp Val Glu Thr 
405 410 415 



14 




Gly Asn Cys lie His Thr Leu Thr Gly His Gin Ser Leu Thr Ser Gly 
420 425 430 

Met Glu Leu Lys Asp Asn He Leu Val Ser Gly Asn Ala Asp Ser Thr 
435 440 445 

Val Lys He Trp Asp He Lys Thr Gly Gin Cys Leu Gin Thr Leu Gin 
450 455 460 

Gly Pro Asn Lys His Gin Ser Ala Val Thr Cys Leu Gin Phe Asn Lys 
465 470 475 480 

Asn Phe Val He Thr Ser Ser Asp Asp Gly Thr Val Lys Leu Trp Asp 
485 490 495 

Leu Lys Thr Gly Glu Phe He Arg Asn Leu Val Thr Leu Glu Ser Gly 
500 505 510 

Gly Ser Gly Gly Val Val Trp Arg He Arg Ala Ser Asn Thr Lys Leu 
515 520 525 

Val Cys Ala Val Gly Ser Arg Asn Gly Thr Glu Glu Thr Lys Leu Leu 
530 535 540 

Val Leu Asp Phe Asp Val Asp Met Lys 
545 550 

<210> 6 
<211> 545 
<212> PRT 

<213> Homo sapiens 
<400> 6 

15 



Met lie Phe Tyr Lys Met Lys Arg Lys Leu Asp His Gly Ser Glu Val 



Arg Ser Phe Ser Leu Gly Lys Lys Pro Cys Lys Val Ser Glu Tyr Thr 
20 25 30 



Ser Thr Thr Gly Leu Val Pro Cys Ser Ala Thr Pro Thr Thr Phe Gly 



Asp Leu Arg Ala Ala Asn Gly Gin Gly Gin Gin Arg Arg Arg lie Thr 



Ser Val Gin Pro Pro Thr Gly Leu Gin Glu Trp Leu Lys Met Phe Gin 



Ser Trp Ser Gly Pro Glu Lys Leu Leu Ala Leu Asp Glu Leu lie Asp 



Ser Cys Glu Pro Thr Gin Val Lys His Met Met Gin Val lie Glu Pro 
100 105 110 



Gin Phe Gin Arg Asp Phe He Ser Leu Leu Pro Lys Glu Leu Ala Leu 
115 120 125 



Tyr Val Leu Ser Phe Leu Glu Pro Lys Asp Leu Leu Gin Ala Ala Gin 
130 135 140 



Thr Cys Arg Tyr Trp Arg lie Leu Ala Glu Asp Asn Leu Leu Trp Arg 
145 150 155 160 



Glu Lys Cys Lys Glu Glu Gly He Asp Glu Pro Leu His lie Lys Arg 
165 170 175 



Arg Lys Val He Lys Pro Gly Phe He His Ser Pro Trp Lys Ser Ala 



16 



Tyr He Arg Gin His Arg He Asp Thr Asn Trp Arg Arg Gly Glu Leu 
195 200 205 



Lys Ser Pro Lys Val Leu Lys Gly His Asp Asp His Val He Thr Cys 
210 215 220 



Leu Gin Phe Cys Gly Asn Arg He Val Ser Gly Ser Asp Asp Asn Thr 
225 230 235 240 



Leu Lys Val Trp Ser Ala Val Thr Gly Lys Cys Leu Arg Thr Leu Val 
245 250 255 



Gly His Thr Gly Gly Val Trp Ser Ser Gin Met Arg Asp Asn lie He 
260 265 270 



He Ser Gly Ser Thr Asp Arg Thr Leu Lys Val Trp Asn Ala Glu Thr 
275 280 285 



Gly Glu Cys He His Thr Leu Tyr Gly His Thr Ser Thr Val Arg Cys 
290 295 300 



Met His Leu His Glu Lys Arg Val Val Ser Gly Ser Arg Asp Ala Thr 
305 310 315 320 



Leu Arg Val Trp Asp He Glu Thr Gly Gin Cys Leu His Val Leu Met 
325 330 335 



Gly His Val Ala Ala Val Arg Cys Val Gin Tyr Asp Gly Arg Arg Val 
340 345 350 



Val Ser Gly Ala Tyr Asp Phe Met Val Lys Val Trp Asp Pro Glu Thr 
355 360 365 



17 



Glu Thr Cys Leu His 
370 

Leu Gin Phe Asp Gly 
385 

lie Arg Val Trp Asp 
405 

Gly His Gin Ser Leu 
420 

Val Ser Gly Asn Ala 
435 

Gly Gin Cys Leu Gin 
450 

Val Thr Cys Leu Gin 
465 

Asp Gly Thr Val Lys 
485 

Asn Leu Val Thr Leu 
500 

lie Arg Ala Ser Asn 
515 

Gly Thr Glu Glu Thr 
530 



Thr Leu Gin Gly His Thr 
375 

lie His Val Val Ser Gly 
390 395 

Val Glu Thr Gly Asn Cys 
. 410 

Thr Ser Gly Met Glu Leu 
425 

Asp Ser Thr Val Lys lie 
440 

Thr Leu Gin Gly Pro Asn 
455 

Phe Asn Lys Asn Phe Val 
470 475 

Leu Trp Asp Leu Lys Thr 
490 

Glu Ser Gly Gly Ser Gly 
505 

Thr Lys Leu Val Cys Ala 
520 

Lys Leu Leu Val Leu Asp 
535 



Asn Arg Val Tyr Ser 
380 

Ser Leu Asp Thr Ser 
400 

lie His Thr Leu Thr 
415 

Lys Asp Asn lie Leu 
430 

Trp Asp lie Lys Thr 
445 

Lys His Gin Ser Ala 
460 

lie Thr Ser Ser Asp 
480 

Gly Glu Phe lie Arg 
495 

Gly Val Val Trp Arg 
510 

Val Gly Ser Arg Asn 
525 

Phe Asp Val Asp Met 
540 



Lys 
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545 

<210> 7 

<211> 540 

<212> PRT 

<213> Homo sapiens 

<400> 7 

Met Lys Arg Lys Leu Asp His Gly Ser Glu Val Arg Ser Phe Ser Leu 
15 10 15 

Gly Lys Lys Pro Cys Lys Val Ser Glu Tyr Thr Ser Thr Thr Gly Leu 
20 25 30 

Val Pro Cys Ser Ala Thr Pro Thr Thr Phe Gly Asp Leu Arg Ala Ala 
35 40 45 

Asn Gly Gin Gly Gin Gin Arg Arg Arg lie Thr Ser Val Gin Pro Pro 
50 55 60 

Thr Gly Leu Gin Glu Trp Leu Lys Met Phe Gin Ser Trp Ser Gly Pro 
65 70 75 80 

Glu Lys Leu Leu Ala Leu Asp Glu Leu lie Asp Ser Cys Glu Pro Thr 
85 90 95 

Gin Val Lys His Met Met Gin Val He Glu Pro Gin Phe Gin Arg Asp 
100 105 110 

Phe He Ser Leu Leu Pro Lys Glu Leu Ala Leu Tyr Val Leu Ser Phe 
115 120 125 



Leu Glu Pro Lys Asp Leu Leu Gin Ala Ala Gin Thr Cys Arg Tyr Trp 



Arg lie Leu Ala Glu Asp Asn Leu Leu Trp Arg Glu Lys Cys Lys Glu 
145 150 155 160 



Glu Gly lie Asp Glu Pro Leu His lie Lys Arg Arg Lys Val lie Lys 
165 170 175 



Pro Gly Phe lie His Ser Pro Trp Lys Ser Ala Tyr lie Arg Gin His 
180 185 190 



Arg lie Asp Thr Asn Trp Arg Arg Gly Glu Leu Lys Ser Pro Lys Val 
195 200 205 



Leu Lys Gly His Asp Asp His Val lie Thr Cys Leu Gin Phe Cys Gly 
210 215 220 



Asn Arg lie Val Ser Gly Ser Asp Asp Asn Thr Leu Lys Val Trp Ser 
225 230 235 240 



Ala Val Thr Gly Lys Cys Leu Arg Thr Leu Val Gly His Thr Gly Gly 
245 250 255 



Val Trp Ser Ser Gin Met Arg Asp Asn lie lie lie Ser Gly Ser Thr 
260 265 270 



Asp Arg Thr Leu Lys Val Trp Asn Ala Glu Thr Gly Glu Cys lie His 
275 280 285 



Thr Leu Tyr Gly His Thr Ser Thr Val Arg Cys Met His Leu His Glu 
290 295 300 



Lys Arg Val Val Ser Gly Ser Arg Asp Ala Thr Leu Arg Val Trp Asp 
305 310 315 320 
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lie Glu Thr Gly Gin Cys Leu His Val Leu Met Gly His Val Ala Ala 
325 330 335 

Val Arg Cys Val Gin Tyr Asp Gly Arg Arg Val Val Ser Gly Ala Tyr 
340 345 350 

Asp Phe Met Val Lys Val Trp Asp Pro Glu Thr Glu Thr Cys Leu His 
355 360 365 

Thr Leu Gin Gly His Thr Asn Arg Val Tyr Ser Leu Gin Phe Asp Gly 
370 375 380 

lie His Val Val Ser Gly Ser Leu Asp Thr Ser lie Arg Val Trp Asp 
385 390 395 400 

Val Glu Thr Gly Asn Cys lie His Thr Leu Thr Gly His Gin Ser Leu 
405 410 415 

Thr Ser Gly Met Glu Leu Lys Asp Asn lie Leu Val Ser Gly Asn Ala 
420 425 430 

Asp Ser Thr Val Lys lie Trp Asp lie Lys Thr Gly Gin Cys Leu Gin 
435 440 445 

Thr Leu Gin Gly Pro Asn Lys His Gin Ser Ala Val Thr Cys Leu Gin 
450 455 460 

Phe Asn Lys Asn Phe Val lie Thr Ser Ser Asp Asp Gly Thr Val Lys 
465 470 475 480 

Leu Trp Asp Leu Lys Thr Gly Glu Phe lie Arg Asn Leu Val Thr Leu 
485 490 495 

Glu Ser Gly Gly Ser Gly Gly Val Val Trp Arg lie Arg Ala Ser Asn 
500 505 510 
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Thr Lys Leu Val Cys Ala Val Gly Ser Arg Asn Gly Thr Glu Glu Thr 
515 520 525 



Lys Leu Leu Val Leu Asp Phe Asp Val Asp Met Lys 
530 535 540 

<210> 8 

<211> 589 

<212> PRT 

<213> Homo sapiens 

<400> 8 

Met Ser Lys Pro Gly Lys Pro Thr Leu Asn His Gly Leu Val Pro Val 
15 10 15 

Asp Leu Lys Ser Ala Lys Glu Pro Leu Pro His Gin Thr Val Met Lys 
20 25 30 

lie Phe Ser lie Ser lie lie Ala Gin Gly Leu Pro Phe Cys Arg Arg 
35 40 45 

Arg Met Lys Arg Lys Leu Asp His Gly Ser Glu Val Arg Ser Phe Ser 
50 55 60 

Leu Gly Lys Lys Pro Cys Lys Val Ser Glu Tyr Thr Ser Thr Thr Gly 
65 70 75 80 

Leu Val Pro Cys Ser Ala Thr Pro Thr Thr Phe Gly Asp Leu Arg Ala 
85 90 95 



Ala Asn Gly Gin Gly Gin Gin Arg Arg Arg lie Thr Ser Val Gin Pro 



Pro Thr Gly Leu Gin Glu Trp Leu Lys Met Phe Gin Ser Trp Ser Gly 
115 120 125 



Pro Glu Lys Leu Leu Ala Leu Asp Glu Leu lie Asp Ser Cys Glu Pro 
130 135 140 



Thr Gin Val Lys His Met Met Gin Val lie Glu Pro Gin Phe Gin Arg 
145 150 155 160 



Asp Phe lie Ser Leu Leu Pro Lys Glu Leu Ala Leu Tyr Val Leu Ser 
165 170 175 



Phe Leu Glu Pro Lys Asp Leu Leu Gin Ala Ala Gin Thr Cys Arg Tyr 
180 185 190 



Trp Arg He Leu Ala Glu Asp Asn Leu Leu Trp Arg Glu Lys Cys Lys 
195 200 205 



Glu Glu Gly He Asp Glu Pro Leu His He Lys Arg Arg Lys Val He 
210 215 220 



Lys Pro Gly Phe He His Ser Pro Trp Lys Ser Ala Tyr He Arg Gin 
225 230 235 240 



His Arg He Asp Thr Asn Trp Arg Arg Gly Glu Leu Lys Ser Pro Lys 
245 250 255 



Val Leu Lys Gly His Asp Asp His Val He Thr Cys Leu Gin Phe Cys 
260 265 270 



Gly Asn Arg He Val Ser Gly Ser Asp Asp Asn Thr Leu Lys Val Trp 
275 280 285 



Ser Ala Val Thr Gly Lys Cys Leu Arg Thr Leu Val Gly His Thr Gly 
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Gly Val Trp Ser Ser Gin Met Arg Asp Asn lie lie lie Ser Gly Ser 
305 310 315 320 



Thr Asp Arg Thr Leu Lys Val Trp Asn Ala Glu Thr Gly Glu Cys lie 
325 330 335 



His Thr Leu Tyr Gly His Thr Ser Thr Val Arg Cys Met His Leu His 
340 345 350 



Glu Lys Arg Val Val Ser Gly Ser Arg Asp Ala Thr Leu Arg Val Trp 
355 360 365 



Asp lie Glu Thr Gly Gin Cys Leu His Val Leu Met Gly His Val Ala 
370 375 380 



Ala Val Arg Cys Val Gin Tyr Asp Gly Arg Arg Val Val Ser Gly Ala 
385 390 395 400 



Tyr Asp Phe Met Val Lys Val Trp Asp Pro Glu Thr Glu Thr Cys Leu 
405 410 415 



His Thr Leu Gin Gly His Thr Asn Arg Val Tyr Ser Leu Gin Phe Asp 
420 425 430 



Gly lie His Val Val Ser Gly Ser Leu Asp Thr Ser lie Arg Val Trp 
435 440 445 



Asp Val Glu Thr Gly Asn Cys lie His Thr Leu Thr Gly His Gin Ser 
450 455 460 



Leu Thr Ser Gly Met Glu Leu Lys Asp Asn lie Leu Val Ser Gly Asn 
465 470 475 480 
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Ala Asp Ser Thr Val Lys lie Trp Asp He Lys Thr Gly Gin Cys Leu 
485 490 495 

Gin Thr Leu Gin Gly Pro Asn Lys His Gin Ser Ala Val Thr Cys Leu 
500 505 510 

Gin Phe Asn Lys Asn Phe Val He Thr Ser Ser Asp Asp Gly Thr Val 
515 520 525 

Lys Leu Trp Asp Leu Lys Thr Gly Glu Phe He Arg Asn Leu Val Thr 
530 535 540 

Leu Glu Ser Gly Gly Ser Gly Gly Val Val Trp Arg He Arg Ala Ser 
545 550 555 560 

Asn Thr Lys Leu Val Cys Ala Val Gly Ser Arg Asn Gly Thr Glu Glu 
565 570 575 

Thr Lys Leu Leu Val Leu Asp Phe Asp Val Asp Met Lys 
580 585 

<210> 9 

<211> 559 

<212> PRT 

<213> Homo sapiens 

<400> 9 

Met Lys He Phe Ser He Ser He He Ala Gin Gly Leu Pro Phe Cys 
15 10 15 

Arg Arg Arg Met Lys Arg Lys Leu Asp His Gly Ser Glu Val Arg Ser 
20 25 30 



25 



Phe Ser Leu Gly Lys Lys Pro Cys Lys Val Ser Glu Tyr Thr Ser Thr 
35 40 45 



Thr Gly Leu Val Pro Cys Ser Ala Thr Pro Thr Thr Phe Gly Asp Leu 



Arg Ala Ala Asn Gly Gin Gly Gin Gin Arg Arg Arg lie Thr Ser Val 
65 70 75 80 



Gin Pro Pro Thr Gly Leu Gin Glu Trp Leu Lys Met Phe Gin Ser Trp 



Ser Gly Pro Glu Lys Leu Leu Ala Leu Asp Glu Leu lie Asp Ser Cys 
100 105 110 



Glu Pro Thr Gin Val Lys His Met Met Gin Val lie Glu Pro Gin Phe 
115 120 125 



Gin Arg Asp Phe lie Ser Leu Leu Pro Lys Glu Leu Ala Leu Tyr Val 
130 135 140 



Leu Ser Phe Leu Glu Pro Lys Asp Leu Leu Gin Ala Ala Gin Thr Cys 
145 150 155 160 



Arg Tyr Trp Arg lie Leu Ala Glu Asp Asn Leu Leu Trp Arg Glu Lys 
165 170 175 



Cys Lys Glu Glu Gly lie Asp Glu Pro Leu His lie Lys Arg Arg Lys 
180 185 190 



Val lie Lys Pro Gly Phe lie His Ser Pro Trp Lys Ser Ala Tyr lie 
195 200 205 



Arg Gin His Arg lie Asp Thr Asn Trp Arg Arg Gly Glu Leu Lys Ser 
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Pro Lys Val Leu Lys Gly His Asp Asp His Val lie Thr Cys Leu Gin 
225 230 235 240 



Phe Cys Gly Asn Arg lis Val Ser Gly Ser Asp Asp Asn Thr Leu Lys 
245 250 255 



Val Trp Ser Ala Val Thr Gly Lys Cys Leu Arg Thr Leu Val Gly His 
260 265 270 



Thr Gly Gly Val Trp Ser Ser Gin Met Arg Asp Asn lie lie lie Ser 
275 280 285 



Gly Ser Thr Asp Arg Thr Leu Lys Val Trp Asn Ala Glu Thr Gly Glu 
290 295 300 



Cys lie His Thr Leu Tyr Gly His Thr Ser Thr Val Arg Cys Met His 
305 310 315 320 



Leu His Glu Lys Arg Val Val Ser Gly Ser Arg Asp Ala Thr Leu Arg 
325 330 335 



Val Trp Asp lie Glu Thr Gly Gin Cys Leu His Val Leu Met Gly His 
340 345 350 



Val Ala Ala Val Arg Cys Val Gin Tyr Asp Gly Arg Arg Val Val Ser 
355 360 365 



Gly Ala Tyr Asp Phe Met Val Lys Val Trp Asp Pro Glu Thr Glu Thr 
370 375 380 



Cys Leu His Thr Leu Gin Gly His Thr Asn Arg Val Tyr Ser Leu Gin 
385 390 395 400 
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Phe Asp Gly lie 

Val Trp Asp Val 
420 

Gin Ser Leu Thr 
435 

Gly Asn Ala Asp 
450 

Cys Leu Gin Thr 
465 

Cys Leu Gin Phe 

Thr Val Lys Leu 
500 

Val Thr Leu Glu 
515 

Ala Ser Asn Thr 
530 

Glu Glu Thr Lys 
545 




His Val Val Ser 
405 

Glu Thr Gly Asn 

Ser Gly Met Glu 
440 

Ser Thr Val Lys 
455 

Leu Gin Gly Pro 
470 

Asn Lys Asn Phe 
485 

Trp Asp Leu Lys 

Ser Gly Gly Ser 
520 

Lys Leu Val Cys 
535 

Leu Leu Val Leu 
550 



Gly Ser Leu Asp 
410 

Cys lie His Thr 
425 

Leu Lys Asp Asn 

lie Trp Asp lie 
460 

Asn Lys His Gin 
475 

Val lie Thr Ser 
490 

Thr Gly Glu Phe 
505 

Gly Gly Val Val 

Ala Val Gly Ser 
540 

Asp Phe Asp Val 
555 




Thr Ser lie Arg 
415 

Leu Thr Gly His 
430 

lie Leu Val Ser 
445 

Lys Thr Gly Gin 

Ser Ala Val Thr 
480 

Ser Asp Asp Gly 
495 

lie Arg Asn Leu 
510 

Trp Arg lie Arg 
525 

Arg Asn Gly Thr 
Asp Met Lys 



<210> 10 
<211> 540 
<212> PRT 
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<213> Homo sapiens 



<400> 10 

Met Lys Arg Lys Leu Asp His Gly Ser Glu Val Arg Ser Phe Ser Lei 



Gly Lys Lys Pro Cys Lys Val Ser Glu Tyr Thr Ser Thr Thr Gly Leu 



Val Pro Cys Ser Ala Thr Pro Thr Thr Phe Gly Asp Leu Arg Ala Ala 



Asn Gly Gin Gly Gin Gin Arg Arg Arg lie Thr Ser Val Gin Pro Pro 



Thr Gly Leu Gin Glu Trp Leu Lys Met Phe Gin Ser Trp Ser Gly Pro 



Glu Lys Leu Leu Ala Leu Asp Glu Leu lie Asp Ser Cys Glu Pro Thr 



Gin Val Lys His Met Met Gin Val lie Glu Pro Gin Phe Gin Arg Asp 
100 105 110 



Phe lie Ser Leu Leu Pro Lys Glu Leu Ala Leu Tyr Val Leu Ser Phe 
115 120 125 



Leu Glu Pro Lys Asp Leu Leu Gin Ala Ala Gin Thr Cys Arg Tyr Trp 
130 135 140 



Arg lie Leu Ala Glu Asp Asn Leu Leu Trp Arg Glu Lys Cys Lys Glu 
145 150 155 160 



Glu Gly lie Asp Glu Pro Leu His lie Lys Arg Arg Lys Val lie Lys 
165 170 175 
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Pro Gly Phe lie His Ser Pro Trp Lys Ser Ala Tyr lie Arg Gin His 
180 185 190 



Arg lie Asp Thr Asn Trp Arg Arg Gly Glu Leu Lys Ser Pro Lys Val 
195 200 205 



Leu Lys Gly His Asp Asp His Val lie Thr Cys Leu Gin Phe Cys Gly 
210 215 220 



Asn Arg lie Val Ser Gly Ser Asp Asp Asn Thr Leu Lys Val Trp Ser 
225 230 235 240 



Ala Val Thr Gly Lys Cys Leu Arg Thr Leu Val Gly His Thr Gly Gly 
245 250 255 



Val Trp Ser Ser Gin Met Arg Asp Asn lie He He Ser Gly Ser Thr 
260 265 270 



Asp Arg Thr Leu Lys Val Trp Asn Ala Glu Thr Gly Glu Cys He His 
275 280 285 



Thr Leu Tyr Gly His Thr Ser Thr Val Arg Cys Met His Leu His Glu 
290 295 300 



Lys Arg Val Val Ser Gly Ser Arg Asp Ala Thr Leu Arg Val Trp Asp 
305 310 315 • 320 



He Glu Thr Gly Gin Cys Leu His Val Leu Met Gly His Val Ala Ala 
325 330 335 



Val Arg Cys Val Gin Tyr Asp Gly Arg Arg Val Val Ser Gly Ala Tyr 
340 345 350 
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Asp Phe Met Val Lys Val Trp Asp Pro Glu Thr Glu Thr Cys Leu His 
355 360 365 

Thr Leu Gin Gly His Thr Asn Arg Val Tyr Ser Leu Gin Phe Asp Gly 
370 375 380 

lie His Val Val Ser Gly Ser Leu Asp Thr Ser lie Arg Val Trp Asp 
385 390 395 400 

Val Glu Thr Gly Asn Cys lie His Thr Leu Thr Gly His Gin Ser Leu 
405 410 415 

Thr Ser Gly Met Glu Leu Lys Asp Asn lie Leu Val Ser Gly Asn Ala 
420 425 430 

Asp Ser Thr Val Lys lie Trp Asp lie Lys Thr Gly Gin Cys Leu Gin 
435 440 445 

Thr Leu Gin Gly Pro Asn Lys His Gin Ser Ala Val Thr Cys Leu Gin 
450 455 460 

Phe Asn Lys Asn Phe Val lie Thr Ser Ser Asp Asp Gly Thr Val Lys 
465 470 475 480 

Leu Trp Asp Leu Lys Thr Gly Glu Phe lie Arg Asn Leu Val Thr Leu 
485 490 495 

Glu Ser Gly Gly Ser Gly Gly Val Val Trp Arg lie Arg Ala Ser Asn 
500 505 510 

Thr Lys Leu Val Cys Ala Val Gly Ser Arg Asn Gly Thr Glu Glu Thr 
515 520 525 

Lys Leu Leu Val Leu Asp Phe Asp Val Asp Met Lys 
530 535 540 
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<210> 11 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide primer 

<400> 11 

cgggatccac catggatgat ggatcgatga cacc 

<210> 12 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide primer 

<400> 12 

ggaattcctt aagggtatac agcatcaaag teg 

<210> 13 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide primer 
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<400> 13 

tcacttcatg tccacatcaa agtcc 

<210> 14 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide primer 

<400> 14 

ggtaattaca agttcttgtt gaactg 

<210> 15 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide primer 

<400> 15 

ccctgcaacg tgtgtagaca gg 

<210> 16 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
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Oligonucleotide primer 



<400> 16 



ccagtctctg cattccacac tttg 



24 



<210> 17 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide primer 

<400> 17 

ctcagacagg tcaggacatt tgg 23 

<210> 18 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide primer 



<400> 18 



ggaattccat gaaaagattg gaccatggtt ctg 



33 



<210> 19 



<211> 34 



<212> DNA 



<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide primer 

<400> 19 

ggaattcctc acttcatgtc acatcaaagt ccag 

<210> 20 
<211> 1881 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 6 myc tagged 
homo sapiens 



<400> 20 



atggagcaaa 


agctcatttc 


tgaagaggac 


ttgaatgaaa 


tggagcaaaa 


gctcatttct 


60 


gaagaggact 


tgaatgaaat 


ggagcaaaag 


ctcatttctg 


aagaggactt 


gaatgaaatg 


120 


gagcaaaagc 


tcatttctga 


agaggacttg 


aatgaaatgg 


agcaaaagct 


catttctgaa 


180 


gaggacttga 


atgaaatgga 


gagcttgggc 


gacctcacca 


tggagcaaaa 


gctcatttct 


240 


gaagaggact 


tgaattccat 


gaaaagaaag 


ttggaccatg 


gttctgaggt 


ccgctctttt 


300 


tctttgggaa 


agaaaccatg 


caaagtctca 


gaatatacaa 


gtaccactgg 


gcttgtacca 


360 


tgttcagcaa 


caccaacaac 


ttttggggac 


ctcagagcag 


ccaatggcca 


agggcaacaa 


420 


cgacgccgaa 


ttacatctgt 


ccagccacct 


acaggcctcc 


aggaatggct 


aaaaatgttt 


480 


cagagctgga 


gtggaccaga 


gaaattgctt 


gctttagatg 


aactcattga 


tagttgtgaa 


540 


ccaacacaag 


taaaacatat 


gatgcaagtg 


atagaacccc 


agtttcaacg 


agacttcatt 


600 


tcattgctcc 


ctaaagagtt 


ggcactctat 


gtgctttcat 


tcctggaacc 


caaagacctg 


660 


ctacaagcag 


ctcagacatg 


tcgctactgg 


agaattttgg 


ctgaagacaa 


ccttctctgg 


720 


agagagaaat 


gcaaagaaga 


ggggattgat 


gaaccattgc 


acatcaagag 


aagaaaagta 


780 


ataaaaccag 


gtttcataca 


cagtccatgg 


aaaagtgcat 


acatcagaca 


gcacagaatt 


840 


gatactaact 


ggaggcgagg 


agaactcaaa 


tctcctaagg 


tgctgaaagg 


acatgatgat 


900 


catgtgatca 


catgcttaca 


gttttgtggt 


aaccgaatag 


ttagtggttc 


tgatgacaac 


960 


actttaaaag 


tttggtcagc 


agtcacaggc 


aaatgtctga 


gaacattagt 


gggacataca 


1020 


ggtggagtat 


ggtcatcaca 


aatgagggac 


aacatcatca 


ttagtggatc 


tacagatcgg 


1080 
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• 






• 






a c a c t c aaag 


tgtggaatgc 


agaga c t gga 


Ullglgttg 




tgggcatac ^ 


1140 


tccactgtgc 


g t 1 g t a t g c a. 


tcttcatgaa 






tegagatgee 


1200 


ac t c t taggg 


1 1 tcfcfcfa.ta.fc 






atgttttgat 


gggtcatgtt 


1260 


gcagcagtcc 


cjc tgtgt tea 


atltgZggc 






atatgatttt 


1320 


atggtaaagg 


tgtgggatcc 








ggggcatac t 


1380 


aatagagtct 


at tcattaca 


gt t tgatggt 






tcttgataca 


1440 


tccatccgtg 


1 1 1 ggga tgt 


ggagacaggg 








1500 


tcgttaacaa 


g t ggaa t gga 


actcaaagac 


aatat fcc t tg 




tgeagattet 


1560 


acagt taaaa 


tc tgggatat 


c aaaac ag ga 






aggtcccaac 


1620 


aagcatcaga 


cj t g c t cj t ga c 


ctgtt ^ g 




actttgtaat 


tac cage tea 


1680 


g"a t ga t ggaa 


c tgtaaaact 


atgggacttg 




aatttattcg 


aaac c tagt c 


1740 


acattggaga 


gtggggggag 


tgggggagtt 


gtgtggcgga 




aaac acaaag 


1800 


ctggtgtgtg 


cagt tgggag 


tcggaatggg 




c caagc tgc t 


ggtgctggac 


1860 


tttgatgtgg 


acatgaagtg 


a 











<210> 21 
<211> 626 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 6 myc tagged 
homo sapien 

<400> 21 

Met Glu Gin Lys Leu lie Ser Glu Glu Asp Leu Asn Glu Met Glu Gin 
15 10 15 

Lys Leu lie Ser Glu Glu Asp Leu Asn Glu Met Glu Gin Lys Leu lie 
20 25 30 

Ser Glu Glu Asp Leu Asn Glu Met Glu Gin Lys Leu He Ser Glu Glu 
35 40 45 
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Asp Leu Asn Glu Met Glu Gin Lys Leu lie Ser Glu Glu Asp Leu Asn 



Glu Met Glu Ser Leu Gly Asp Leu Thr Met Glu Gin Lys Leu He Ser 



Glu Glu Asp Leu Asn Ser Met Lys Arg Lys Leu Asp His Gly Ser Glu 



Val Arg Ser Phe Ser Leu Gly Lys Lys Pro Cys Lys Val Ser Glu Tyr 
100 105 110 



Thr Ser Thr Thr Gly Leu Val Pro Cys Ser Ala Thr Pro Thr Thr Phe 
115 120 125 



Gly Asp Leu Arg Ala Ala Asn Gly Gin Gly Gin Gin Arg Arg Arg He 
130 135 140 



Thr Ser Val Gin Pro Pro Thr Gly Leu Gin Glu Trp Leu Lys Met Phe 
145 150 155 160 



Gin Ser Trp Ser Gly Pro Glu Lys Leu Leu Ala Leu Asp Glu Leu He 
165 170 175 



Asp Ser Cys Glu Pro Thr Gin Val Lys His Met Met Gin Val He Glu 
180 185 190 



Pro Gin Phe Gin Arg Asp Phe He Ser Leu Leu Pro Lys Glu Leu Ala 
195 200 205 



Leu Tyr Val Leu Ser Phe Leu Glu Pro Lys Asp Leu Leu Gin Ala Ala 
210 215 220 



Gin Thr Cys Arg Tyr Trp Arg He Leu Ala Glu Asp Asn Leu Leu Trp 
225 230 235 240 
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Arg Glu Lys Cys Lys Glu Glu Gly He Asp Glu Pro Leu His He Lys 
245 250 255 



Arg Arg Lys Val He Lys Pro Gly Phe He His Ser Pro Trp Lys Ser 
260 265 270 



Ala Tyr He Arg Gin His Arg He Asp Thr Asn Trp Arg Arg Gly Glu 
275 280 285 



Leu Lys Ser Pro Lys Val Leu Lys Gly His Asp Asp His Val He Thr 
290 295 300 



Cys Leu Gin Phe Cys Gly Asn Arg He Val Ser Gly Ser Asp Asp Asn 
305 310 315 320 



Thr Leu Lys Val Trp Ser Ala Val Thr Gly Lys Cys Leu Arg Thr Leu 
325 330 335 



Val Gly His Thr Gly Gly Val Trp Ser Ser Gin Met Arg Asp Asn He 
340 345 350 



He He Ser Gly Ser Thr Asp Arg Thr Leu Lys Val Trp Asn Ala Glu 
355 360 365 



Thr Gly Glu Cys He His Thr Leu Tyr Gly His Thr Ser Thr Val Arg 
370 375 380 



Cys Met His Leu His Glu Lys Arg Val Val Ser Gly Ser Arg Asp Ala 
385 390 395 400 



Thr Leu Arg Val Trp Asp He Glu Thr Gly Gin Cys Leu His Val Leu 
405 410 415 
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Met Gly His Val Ala Ala Val Arg Cys Val Gin Tyr Asp Gly Arg Arg 
420 425 430 



Val Val Ser Gly Ala Tyr Asp Phe Met Val Lys Val Trp Asp Pro Glu 
435 440 445 



Thr Glu Thr Cys Leu His Thr Leu Gin Gly His Thr Asn Arg Val Tyr 
450 455 460 



Ser Leu Gin Phe Asp Gly He His Val Val Ser Gly Ser Leu Asp Thr 
465 470 475 480 



Ser He Arg Val Trp Asp Val Glu Thr Gly Asn Cys lie His Thr Leu 
485 490 495 



Thr Gly His Gin Ser Leu Thr Ser Gly Met Glu Leu Lys Asp Asn He 
500 505 510 



Leu Val Ser Gly Asn Ala Asp Ser Thr Val Lys He Trp Asp He Lys 
515 520 525 



Thr Gly Gin Cys Leu Gin Thr Leu Gin Gly Pro Asn Lys His Gin Ser 
530 535 540 



Ala Val Thr Cys Leu Gin Phe Asn Lys Asn Phe Val He Thr Ser Ser 
545 550 555 560 



Asp Asp Gly Thr Val Lys Leu Trp Asp Leu Lys Thr Gly Glu Phe He 
565 570 575 



Arg Asn Leu Val Thr Leu Glu Ser Gly Gly Ser Gly Gly Val Val Trp 
580 585 590 



Arg He Arg Ala Ser Asn Thr Lys Leu Val Cys Ala Val Gly Ser Arg 
595 600 605 
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Asn Gly Thr Glu Glu Thr Lys Leu Leu Val Leu Asp Phe Asp Val Asp 
610 615 620 

Met Lys 
625 

<210> 22 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide primer 

<400> 22 

gggtacccct cattattccc tcgagttctt c 31 

<210> 23 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide primer 

<400> 23 

ggaattcctt catgtccaca tcaaagtcc 29 

<210> 24 
<211> 2010 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: V5HIS tagged, 
homo sapien 

<400> 24 

atgtgtgtcc cgagaagcgg tttgatactg agctgcattt gcctttactg tggagttttg 6 0 
ttgccggttc tgctccctaa tcttcctttt ctgacgtgcc tgagcatgtc cacattagaa 12 0 
tctgtgacat acctacctga aaaaggttta tattgtcaga gactgccaag cagccggaca 180 
cacgggggca cagaatcact gaaggggaaa aatacagaaa atatgggttt ctacggcaca 240 
ttaaaaatga ttttttacaa aatgaaaaga aagttggacc atggttctga ggtccgctct 3 00 
ttttctttgg gaaagaaacc atgcaaagtc tcagaatata caagtaccac tgggcttgta 3 60 
ccatgttcag caacaccaac aacttttggg gacctcagag cagccaatgg ccaagggcaa 42 0 
caacgacgcc gaattacatc tgtccagcca cctacaggcc tccaggaatg gctaaaaatg 48 0 
tttcagagct ggagtggacc agagaaattg cttgctttag atgaactcat tgatagttgt 54 0 
gaaccaacac aagtaaaaca tatgatgcaa gtgatagaac cccagtttca acgagacttc 600 
atttcattgc tccctaaaga gttggcactc tatgtgcttt cattcctgga acccaaagac 660 
ctgctacaag cagctcagac atgtcgctac tggagaattt tggctgaaga caaccttctc 72 0 
tggagagaga aatgcaaaga agaggggatt gatgaaccat tgcacatcaa gagaagaaaa 78 0 
gtaataaaac caggtttcat acacagtcca tggaaaagtg catacatcag acagcacaga 840 
attgatacta actggaggcg aggagaactc aaatctccta aggtgctgaa aggacatgat 900 
gatcatgtga tcacatgctt acagttttgt ggtaaccgaa tagttagtgg ttctgatgac 9 60 
aacactttaa aagtttggtc agcagtcaca ggcaaatgtc tgagaacatt agtgggacat 102 0 
acaggtggag tatggtcatc acaaatgaga gacaacatca tcattagtgg atctacagat 108 0 
cggacactca aagtgtggaa tgcagagact ggagaatgta tacacacctt atatgggcat 1140 
acttccactg tgcgttgtat gcatcttcat gaaaaaagag ttgttagcgg ttctcgagat 1200 
gccactctta gggtttggga tattgagaca ggccagtgtt tacatgtttt gatgggtcat 12 6 0 
gttgcagcag tccgctgtgt tcaatatgat ggcaggaggg ttgttagtgg agcatatgat 132 0 
tttatggtaa aggtgtggga tccagagact gaaacctgtc tacacacgtt gcaggggcat 13 80 
actaatagag tctattcatt acagtttgat ggtatccatg tggtgagtgg atctcttgat 1440 
acatcaatcc gtgtttggga tgtggagaca gggaattgca ttcacacgtt aacagggcac 1500 
cagtcgttaa caagtggaat ggaactcaaa gacaatattc ttgtctctgg gaatgcagat 15 60 
tctacagtta aaatctggga tatcaaaaca ggacagtgtt tacaaacatt gcaaggtccc 162 0 
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aacaagcatc agagtgctgt gacctgttta cagttcaaca agaactttgt aattaccagc 1680 
tcagatgatg gaactgtaaa actatgggac ttgaaaacgg gtgaatttat tcgaaaccta 1740 
gtcacattgg agagtggggg gagtggggga gttgtgtggc ggatcagagc ctcaaacaca 1800 
aagctggtgt gtgcagttgg gagtcggaat gggactgaag aaaccaagct gctggtgctg 1860 
gactttgatg tggacatgaa ggaattctgc agatatccag cacagtggcg gccgctcgag 1920 
tctagagggc ccttcgaagg taagcctatc cctaaccctc tcctcggtct cgattctacg 1980 
cgtaccggtc atcatcacca tcaccattga 2 010 

<210> 25 
<211> 669 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: V5HIS tagged 
homo sapien 

<400> 25 

Met Cys Val Pro Arg Ser Gly Leu lie Leu Ser Cys lie Cys Leu Tyr 



Cys Gly Val Leu Leu Pro Val Leu Leu Pro Asn Leu Pro Phe Leu Thr 



Cys Leu Ser Met Ser Thr Leu Glu Ser Val Thr Tyr Leu Pro Glu Lys 



Gly Leu Tyr Cys Gin Arg Leu Pro Ser Ser Arg Thr His Gly Gly Thr 



Glu Ser Leu Lys Gly Lys Asn Thr Glu Asn Met Gly Phe Tyr Gly Thr 
65 70 75 80 



Leu Lys Met lie Phe Tyr Lys Met Lys Arg Lys Leu Asp His Gly Ser 
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Glu Val Arg Ser Phe Ser Leu Gly Lys Lys Pro Cys Lys Val Ser Glu 
100 105 HO 



Tyr Thr Ser Thr 
115 

Phe Gly Asp Leu 
130 

lie Thr Ser Val 
145 

Phe Gin Ser Trp 



lie Asp Ser Cys 
180 

Glu Pro Gin Phe 
195 

Ala Leu Tyr Val 
210 

Ala Gin Thr Cys 
225 

Trp Arg Glu Lys 



Lys Arg Arg Lys 
260 



Thr Gly Leu Val 
120 

Arg Ala Ala Asn 
135 

Gin Pro Pro Thr 
150 

Ser Gly Pro Glu 
165 

Glu Pro Thr Gin 



Gin Arg Asp Phe 
200 

Leu Ser Phe Leu 
215 

Arg Tyr Trp Arg 
230 

Cys Lys Glu Glu 
245 

Val lie Lys Pro 



Pro Cys Ser Ala 

Gly Gin Gly Gin 
140 

Gly Leu Gin Glu 
155 

Lys Leu Leu Ala 
170 

Val Lys His Met 
185 

lie Ser Leu Leu 



Glu Pro Lys Asp 
220 

lie Leu Ala Glu 
235 

Gly lie Asp Glu 
250 

Gly Phe lie His 
265 



Thr Pro Thr Thr 
125 

Gin Arg Arg Arg 

Trp Leu Lys Met 
160 

Leu Asp Glu Leu 
175 

Met Gin Val lie 
190 

Pro Lys Glu Leu 
205 

Leu Leu Gin Ala 



Asp Asn Leu Leu 
240 

Pro Leu His lie 
255 

Ser Pro Trp Lys 
270 
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Ser Ala Tyr He Arg Gin His Arg He Asp Thr Asn Trp Arg Arg Gly 
275 280 285 



Glu Leu Lys Ser 
290 

Thr Cys Leu Gin 
305 

Asn Thr Leu Lys 

Leu Val Gly His 
340 

He He He Ser 
355 

Glu Thr Gly Glu 
370 

Arg Cys Met His 
385 

Ala Thr Leu Arg 

Leu Met Gly His 
420 

Arg Val Val Ser 
435 

Glu Thr Glu Thr 



Pro Lys Val Leu 
295 

Phe Cys Gly Asn 
310 

Val Trp Ser Ala 
325 

Thr Gly Gly Val 

Gly Ser Thr Asp 
360 

Cys He His Thr 
375 

Leu His Glu Lys 
390 

Val Trp Asp He 
405 

Val Ala Ala Val 



Gly Ala Tyr Asp 
440 

Cys Leu His Thr 



Lys Gly His Asp 
300 

Arg He Val Ser 
315 

Val Thr Gly Lys 

330 

Trp Ser Ser Gin 
345 

Arg Thr Leu Lys 

Leu Tyr Gly His 
380 

Arg Val Val Ser 
395 

Glu Thr Gly Gin 
410 

Arg Cys Val Gin 
425 

Phe Met Val Lys 



Leu Gin Gly His 



Asp His Val He 



Gly Ser Asp Asp 
320 

Cys Leu Arg Thr 
335 

Met Arg Asp Asn 
350 

Val Trp Asn Ala 
365 

Thr Ser Thr Val 



Gly Ser Arg Asp 
400 

Cys Leu His Val 
415 

Tyr Asp Gly Arg 
430 

Val Trp Asp Pro 
445 

Thr Asn Arg Val 
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Tyr Ser Leu Gin Phe Asp Gly lie His Val Val Ser Gly Ser Leu Asp 
465 470 475 480 



Thr Ser lie Arg Val Trp Asp Val Glu Thr Gly Asn Cys He His Thr 
485 490 495 



Leu Thr Gly His Gin Ser Leu Thr Ser Gly Met Glu Leu Lys Asp Asn 
500 505 510 



He Leu Val Ser Gly Asn Ala Asp Ser Thr Val Lys He Trp Asp He 
515 520 525 



Lys Thr Gly Gin Cys Leu Gin Thr Leu Gin Gly Pro Asn Lys His Gin 
530 535 540 



Ser Ala Val Thr Cys Leu Gin Phe Asn Lys Asn Phe Val He Thr Ser 
545 550 555 560 



Ser Asp Asp Gly Thr Val Lys Leu Trp Asp Leu Lys Thr Gly Glu Phe 
565 570 575 



He Arg Asn Leu Val Thr Leu Glu Ser Gly Gly Ser Gly Gly Val Val 
580 585 590 



Trp Arg He Arg Ala Ser Asn Thr Lys Leu Val Cys Ala Val Gly Ser 
595 600 605 



Arg Asn Gly Thr Glu Glu Thr Lys Leu Leu Val Leu Asp Phe Asp Val 
610 615 620 



Asp Met Lys Glu Phe Cys Arg Tyr Pro Ala Gin Trp Arg Pro Leu Glu 
625 630 635 640 
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Ser Arg Gly Pro Phe Glu Gly Lys Pro lie Pro Asn Pro Leu Leu Gly 
645 650 655 



Leu Asp Ser Thr Arg Thr Gly His His His His His His 
660 665 



<210> 26 
<211> 2001 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: MYCHIS tagged 
homo sapiens 



<400> 26 



atgtgtgtcc 


cgagaagcgg 


tttgatactg 


agctgcattt 


gcctttactg 


tggagttttg 


60 


ttgccggttc 


tgctccctaa 


tcttcctttt 


ctgacgtgcc 


tgagcatgtc 


cacattagaa 


120 


tctgtgacat 


acctacctga 


aaaaggttta 


tattgtcaga 


gactgccaag 


cagccggaca 


180 


cacgggggca 


cagaatcact 


gaaggggaaa 


aatacagaaa 


atatgggttt 


ctacggcaca 


240 


ttaaaaatga 


ttttttacaa 


aatgaaaaga 


aagttggacc 


atggttctga 


ggtccgctct 


300 


ttttctttgg 


gaaagaaacc 


atgcaaagtc 


tcagaatata 


caagtaccac 


tgggcttgta 


360 


ccatgttcag 


caacaccaac 


aacttttggg 


gacctcagag 


cagccaatgg 


ccaagggcaa 


420 


caacgacgcc 


gaattacatc 


tgtccagcca 


cctacaggcc 


tccaggaatg 


gctaaaaatg 


480 


tttcagagct 


ggagtggacc 


agagaaattg 


cttgctttag 


atgaactcat 


tgatagttgt 


540 


gaaccaacac 


aagtaaaaca 


tatgatgcaa 


gtgatagaac 


cccagtttca 


acgagacttc 


600 


atttcattgc 


tccctaaaga 


gttggcactc 


tatgtgcttt 


cattcctgga 


acccaaagac 


660 


ctgctacaag 


cagctcagac 


atgtcgctac 


tggagaattt 


tggctgaaga 


caaccttctc 


720 


tggagagaga 


aatgcaaaga 


agaggggatt 


gatgaaccat 


tgcacatcaa 


gagaagaaaa 


780 


gtaataaaac 


caggtttcat 


acacagtcca 


tggaaaagtg 


catacatcag 


acagcacaga 


840 


attgatacta 


actggaggcg 


aggagaactc 


aaatctccta 


aggtgctgaa 


aggacatgat 


900 


gatcatgtga 


tcacatgctt 


acagttttgt 


ggtaaccgaa 


tagttagtgg 


ttctgatgac 


960 


aacactttaa 


aagtttggtc 


agcagtcaca 


ggcaaatgtc 


tgagaacatt 


agtgggacat 


1020 
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acaggtggag 


tatggtcatc 


acaaatgaga 


gacaacatca 


tcattagtgg 


atctacagat 


1080 


cggacactca 


aagtgtggaa 


tgcagagact 


ggagaatgta 


tacacacctt 


atatgggcat 


1140 


acttccactg 


tgcgttgtat 


gcatcttcat 


gaaaaaagag 


ttgttagcgg 


ttctcgagat 


1200 


gccactctta 


gggtttggga 


tattgagaca 


ggccagtgtt 


tacatgtttt 


gatgggtcat 


1260 


gttgcagcag 


tccgctgtgt 


tcaatatgat 


ggcaggaggg 


ttgttagtgg 


agcatatgat 


1320 


tttatggtaa 


aggtgtggga 


tccagagact 


gaaacctgtc 


tacacacgtt 


gcaggggcat 


1380 


actaatagag 


tctattcatt 


acagtttgat 


ggtatccatg 


tggtgagtgg 


atctcttgat 


1440 


acatcaatcc 


gtgtttggga 


tgtggagaca 


gggaattgca 


ttcacacgtt 


aacagggcac 


1500 


cagtcgttaa 


caagtggaat 


ggaactcaaa 


gacaatattc 


ttgtctctgg 


gaatgcagat 


1560 


tctacagtta 


aaatctggga 


tatcaaaaca 


ggacagtgtt 


tacaaacatt 


gcaaggtccc 


1620 


aacaagcatc 


agagtgctgt 


gacctgttta 


cagttcaaca 


agaactttgt 


aattaccagc 


1680 


tcagatgatg 


gaactgtaaa 


actatgggac 


ttgaaaacgg 


gtgaatttat 


tcgaaaccta 


1740 


gtcacattgg 


agagtggggg 


gagtggggga 


gttgtgtggc 


ggatcagagc 


ctcaaacaca 


1800 


aagctggtgt 


gtgcagttgg 


gagtcggaat 


gggactgaag 


aaaccaagct 


gctggtgctg 


1860 


gactttgatg 


tggacatgaa 


ggaattctgc 


agatatccag 


cacagtggcg 


gccgctcgag 


1920 


tctagagggc 


ccttcgaaca 


aaaactcatc 


tcagaagagg 


atctgaatat 


gcataccggt 


1980 


catcatcacc 


atcaccattg 


a 








2001 



<210> 27 
<211> 666 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: MYCHIS tagged 
homo sapiens 

<400> 27 

Met Cys Val Pro Arg Ser Gly Leu lie Leu Ser Cys lie Cys Leu Tyr 
15 10 15 

Cys Gly Val Leu Leu Pro Val Leu Leu Pro Asn Leu Pro Phe Leu Thr 
20 25 30 

Cys Leu Ser Met Ser Thr Leu Glu Ser Val Thr Tyr Leu Pro Glu Lys 
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Gly Leu Tyr Cys Gin Arg Leu Pro Ser Ser Arg Thr His Gly Gly Thr 



Glu Ser Leu Lys Gly Lys Asn Thr Glu Asn Met Gly Phe Tyr Gly Thr 



Leu Lys Met lie Phe Tyr Lys Met Lys Arg Lys Leu Asp His Gly Ser 



Glu Val Arg Ser Phe Ser Leu Gly Lys Lys Pro Cys Lys Val Ser Glu 
100 105 110 



Tyr Thr Ser Thr Thr Gly Leu Val Pro Cys Ser Ala Thr Pro Thr Thr 
115 120 125 



Phe Gly Asp Leu Arg Ala Ala Asn Gly Gin Gly Gin Gin Arg Arg Arg 
130 135 140 



He Thr Ser Val Gin Pro Pro Thr Gly Leu Gin Glu Trp Leu Lys Met 
145 150 155 160 



Phe Gin Ser Trp Ser Gly Pro Glu Lys Leu Leu Ala Leu Asp Glu Leu 
165 170 175 



He Asp Ser Cys Glu Pro Thr Gin Val Lys His Met Met Gin Val He 
180 185 190 



Glu Pro Gin Phe Gin Arg Asp Phe He Ser Leu Leu Pro Lys Glu Leu 
195 200 205 



Ala Leu Tyr Val Leu Ser Phe Leu Glu Pro Lys Asp Leu Leu Gin Ala 
210 215 220 
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Ala Gin Thr Cys Arg Tyr Trp Arg lie Leu Ala Glu Asp Asn Leu Leu 
225 230 235 240 

Trp Arg Glu Lys Cys Lys Glu Glu Gly He Asp Glu Pro Leu His He 
245 250 255 

Lys Arg Arg Lys Val He Lys Pro Gly Phe He His Ser Pro Trp Lys 
260 265 270 

Ser Ala Tyr He Arg Gin His Arg He Asp Thr Asn Trp Arg Arg Gly 
275 280 285 

Glu Leu Lys Ser Pro Lys Val Leu Lys Gly His Asp Asp His Val He 
290 295 300 

Thr Cys Leu Gin Phe Cys Gly Asn Arg He Val Ser Gly Ser Asp Asp 
305 310 315 320 

Asn Thr Leu Lys Val Trp Ser Ala Val Thr Gly Lys Cys Leu Arg Thr 
325 330 335 

Leu Val Gly His Thr Gly Gly Val Trp Ser Ser Gin Met Arg Asp Asn 
340 345 350 

He He He Ser Gly Ser Thr Asp Arg Thr Leu Lys Val Trp Asn Ala 
355 360 365 

Glu Thr Gly Glu Cys He His Thr Leu Tyr Gly His Thr Ser Thr Val 
370 375 380 

Arg Cys Met His Leu His Glu Lys Arg Val Val Ser Gly Ser Arg Asp 
385 390 395 400 

Ala Thr Leu Arg Val Trp Asp He Glu Thr Gly Gin Cys Leu His Val 
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405 410 415 

Leu Met Gly His Val Ala Ala Val Arg Cys Val Gin Tyr Asp Gly Arg 
420 425 430 

Arg Val Val Ser Gly Ala Tyr Asp Phe Met Val Lys Val Trp Asp Pro 
435 440 445 

Glu Thr Glu Thr Cys Leu His Thr Leu Gin Gly His Thr Asn Arg Val 
450 455 460 

Tyr Ser Leu Gin Phe Asp Gly lie His Val Val Ser Gly Ser Leu Asp 
465 470 475 480 

Thr Ser lie Arg Val Trp Asp Val Glu Thr Gly Asn Cys lie His Thr 
485 490 495 

Leu Thr Gly His Gin Ser Leu Thr Ser Gly Met Glu Leu Lys Asp Asn 
500 505 510 

lie Leu Val Ser Gly Asn Ala Asp Ser Thr Val Lys lie Trp Asp lie 
515 520 525 

Lys Thr Gly Gin Cys Leu Gin Thr Leu Gin Gly Pro Asn Lys His Gin 
530 535 540 

Ser Ala Val Thr Cys Leu Gin Phe Asn Lys Asn Phe Val lie Thr Ser 
545 550 555 560 

Ser Asp Asp Gly Thr Val Lys Leu Trp Asp Leu Lys Thr Gly Glu Phe 
565 570 575 

lie Arg Asn Leu Val Thr Leu Glu Ser Gly Gly Ser Gly Gly Val Val 
580 585 590 
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Trp Arg lie Arg Ala Ser Asn Thr 
595 600 

Arg Asn Gly Thr Glu Glu Thr Lys 
610 615 

Asp Met Lys Glu Phe Cys Arg Tyr 
625 630 

Ser Arg Gly Pro Phe Glu Gin Lys 
645 

Met His Thr Gly His His His His 
660 



Lys Leu Val Cys Ala Val Gly Ser 
605 



Leu Leu Val Leu Asp Phe Asp Val 
620 



Pro Ala Gin Trp Arg Pro Leu Glu 
635 640 



Leu lie Ser Glu Glu Asp Leu Asn 
650 655 



His His 
665 
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